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METHODS FOR GENERATING ANTIBIOTIC RESISTANT MICROBES 
AND NOVEL ANTIBIOTICS 

FIELD OF THE INVENTION 

This invention relates to the field of antimicrobial treatments and gene targets for the 
discovery of antimicrobial agents. In particular, it relates to the discovery of genes essential 
for growth and virulence of bacteria. 

BACKGROUND OF THE INVENTION 

Despite the development of new classes of antimicrobial agents over the past decade 
(reviewed in http://vet.purdue.edu/bms), microbial infections remain a serious health problem. 
While antibiotics treatment has been effective in controlling infectious diseases, an increase 
hi the number of antibiotic-resistant (AR) microbes have emerged and are now posing a major 
therapeutic problem. In today's industrialized societies, infectious strains can be found that are 
resistant to all classes of antimicrobial agents used in the clinic. Infections due to resistant 
strains include higher morbidity and mortality, longer patient hospitalization, and an increase 
in treatinent costs (Murray (1994) New Engl. J. Med. 330:1229-1230). In light of these 
findings, an unmet need exists for the development of new therapeutic agents that can work by 
inhibiting the ever-increasing number of novel antibiotic resistance mechanisms. 

One approach for generating new therapies and/or therapeutic strategies against AR 
microbes is to develop methods that can generate a wide array of genomic alterations in a 
microbe's genome that can yield maximal number altered target genes that are capable of 
eliciting antibiotic resistance. Once an AR strain is developed, it can be used for rapid 
genome analysis to identify mutant gene(s) that are capable of rendering a microbe resistant to 
an antibiotic for target identification. Such analysis can involve any of a variety of methods 
used by those skilled in the art for identifying mutations and/or differential gene expression. 
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including but not limited to differential gene expression using microarrays, cDNA subtraction, 
differential protein analysis, complementation assays, single nucleotide polymorphosm (SNP) 
analysis or whole genome sequencing to identify altered loci. 

A bottleneck to generating genetically diverse microbes is the inability to generate 
nonbiased genome-wide mutations. Many mutagenesis methods are available whereby the use 
of chemical and radiation exposure has been successfiil in generating genomic mutations. A 
limitation of this approach is that these various methods are usually DNA site specific or are 
extremely toxic, therefore limitmg the mutation spectra and the opportunity to identify a 
maximal number of genes, when mutated, that are able to confer resistance to an antibiotic. 
Recently, work done by Nicolaides, et al. (Nicolaides et al (1998) Mol. Cell. Biol 18:1635- 
1641; U.S. patent 6,146,894) has demonstrated the utility of mtroducing dominant negative 
mismatch repair mutants into cells to confer global DNA hypermutability. These mutations 
are in the form of point mutations or small insertion-deletions that are distributed equally 
throughout the genome. The ability to manipulate the mismatch repair (MMR) process of a 
target host organism can lead to an increase in the mutability of the target host genome, 
leading to the generation of innovative cell subtypes with varying phenotypes from the 
original wild-type ceUs. Variants can be placed under a specified, desired selective process 
the resuh of which is the capacity to select for a novel organism that expresses an altered 
biological molecule(s) and has a new phenotype. The concept of creating and introducing 
dominant negative allele of a gene, including the MMR alleles, in bacterial cells has been 
documented to result in genetically altered prokaryotic mismatch repair genes (Aronshtam and 
Marinus (1996) Nucl Adds Res. 24:2498-2504; Wu and Marinus (1994) J. Bacterial. 
176:5393-400; Brosh and Matson (1995) J. Bacteriol. 177:5612-5621). Furthermore, altered 
MMR activity has been demonstrated when MMR genes from different species including 
yeast and mammalian cells are over-expressed (Fishel et al. (1993) Cell 7:1027-1038; Lipkin 
et al (2000) Nat. Genet. 24:27-35). The ability to create hypermutable organisms by blocking 
MMR has great commercial value for the generation of AR bacterial strains for drug screening 
and target discovery. 

There is an urgent need in the art to elucidate the mechanisms of antimicrobial 
resistance, and to identify novel antimicrobial agents. 
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SUMMARY 

The invention provides new uses of MMR deficiency in bacteria to identify antibiotic 
resistance (AR) genes and pathways that can lead to the generation of novel therapeutic 
strategies for alternative clinical strategies. 

Antibiotic resistant (AR) microbes express a number of genes that are essential for 
growth of the organism in an infection, and serve as useful reagents for target discovery and/or 
screening lines for the discovery of novel antimicrobial agents. This invention provides an 
approach to the identification of genes that confer anti-microbial resistance, and the use of 
those genes, and bacterial strains expressing mutant forms of genes, in the identification, 
characterization, and evaluation of targets for therapeutic development. In addition, this 
application teaches of the use of employing structural information of the gene, gene product 
and mutant strains in screening for antimicrobial agents active against the genes and their 
corresponding products and pathways. Positive compounds can then be used as final products 
or precursors to be fiirther developed into antibacterial agents. This invention also provides 
methods of treating microbial infections in mammals by administering an antimicrobial agent 
active against such an identified target gene or product, and the pharmaceutical compositions 
effective for such treatment. 

To identify genes capable of rendering bacteria antibiotic-resistant, the invention 
provides methods of decreasing MMR activity of a microbial host to produce AR strains. 
Using this process, commercially viable microbes that are resistant to a wide range of 
antibiotics can be directly selected for the resistance to an anti-microbial agent of interest. AR 
microbes may be genetically screened to identify novel therapeutic targets for drug 
develoment. Once a bacterium with a specified resistance is isolated, the MMR activity may 
be restored by several methods well known to those skilled in the art as a means to gentically 
"fix" the new mutations in the host genome, thereby making the AR microbe suitable for 
comparative molecular analysis to the wild-type strain as well as for drug screening to identify 
novel antimicrobial compounds. For example, if MA^IR is decreased by the use of a dominant- 
negative allele or antisense vector directed to an internal MMR gene, the endogenous repair 
activity can be restored if the gene is expressed by an inducible promoter system, including 
but not limited to promoters such as: TAC-LACI, tryp (Brosius et al. (1984) Gene 27:161- 
172), araBAD (Guzman et al. (1995) J. Bact 177:4121-4130) pLex (La Vallie et al. (1992) 
Bio.Technology 11:187-193), pRSET (Schoepfer, R (1993) Gene 124:83-85), pT7 (Studier 
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(1991) J. Mol Biol. 219(l):37-44) etc., by removing the inducer and, therefore, reducing the 
the promoter activity. In the case that the expression vector employs a Cre-lox system, MMR 
can be restored by disrupting the cDN A gene insert from the host cell harboring the expression 
vector (Hasan, N. et al. (1994) Gene 2:51-56). Yet other methods uiclude homologous 
knockout of the expression vector that can turn off the actively expressed gene used to inhibit 
MMR activity. In addition to the recombinant methods outUned above that have the capacity 
to eliminate the MMR activity from the microbe, it has been demonstrated that many 
chemicals have the ability to "cure" microbial cells of plasmids. For example, chemical 
treatment of cells with drugs including bleomycin (Attfield et al (1985) Antimicrob. Agents 
Chemother. 27:985-988) or novobiocin, coumercycin, and quinolones (Fu et al (1988) 
Chemotherapy 34:415-418) have been shown to result in microbial cells that lack endogenous 
plasmid as evidenced by Southern analysis of cured cells as well as sensitivity to the 
appropriate antibiotic (Attfield et al (1985) Antimicrob. Agents Chemother. 27(6):985-988, Fu 
et al (1988) Chem. Abstracts 34(5):415-418; BiWang et al (1999) J. ofFujian Agricultural 
University 28(l):43-46; Brosius, J. (1988) Biotechnology 10:205-225). Whether by use of 
recombinant means or treatment of cells with chemicals, removal of the MMR-expression 
plasmid results in the reestabUshment of a genetically stable microbial cell line. Therefore, 
the restoration of MMR allows host bacteria to function normally to repau: DNA. The newly 
generated mutant bacterial strain that exhibits a novel anti-microbial resistance is now suitable 
for gene/protem discovery to identify new biomolecules that are involved in generating 
resistance as well as a model system to screen for novel anti-microbial agents targeted against 
certain antibiotic resistant strains. 

In certain embodiments, the invention provides methods for generatmg antibiotic 
resistant bacteria comprising the steps of: 

blocking mismatch repair in the bacterium whereby tiie bacterium becomes 
hypermutable; 

contacting the bacterium with at least one antibiotic 

determining whether the bacterium is resistant to the antibiotic, thereby generating 
antibiotic resistant bacteria. 

In the methods of the invention, mismatch repair may be blocked in some 
embodiments by introducing a polynucleotide encoding a wild-type allele of a mismatch repair 
gene into a cell, whereby the wild-type allele inactivates the endogenous MMR activity by 
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binding to and interfering with the resident activity. The cell becomes hypermutable as a 
result of the introduction of the gene. 

In other embodiments of the invention, a polynucleotide encoding a dominant negative 
allele of a mismatch repair gene is introduced into a cell, where the dominant negative gene is 
derived from a mismatch repair gene from a different organism. The cell becomes 
hypermutable as a result of the introduction of the gene. In particular embodiments of this 
method, MMR activity is inhibited for ten rounds of cell division and then the MMR activity 
is restored therefore restoring the genetic stability. An example of a dominant negative MMR 
gene is the PMS2-134 gene. 

In other embodiments of the invention, a polynucleotide encoding an allele of a 
mismatch repair gene is introduced into a bacterial cell, where the mismatch repair gene is 
derived from a wild-type or altered mammalian, yeast, fungal, amphibian, insect, plant or 
bacterial mismatch repair gene. The cell becomes hypermutable as a result of the introduction 
of the gene. 

In another embodiment, mismatch repair may be blocked by introducing an antisense 
nucleic acid molecule into the bacterium wherein the antisense nucleic acid molecule 
specifically binds to a mismatch repair gene and inhibits mismatch repair in the bacterium. 

In other embodiments of the invention, methods are provided for generating a genetic 
alteration of a bacterial host genome to produce variant strains expressing new output traits. 
Transgenic bacterium comprising a polynucleotide encoding a wild-type allele of a mismatch 
repair gene is grown. The bacteria are comprised of a set of altered genes for a desired 
biological phenotype. 

In other embodiments of the mvention, methods are provided for generating a genetic 
alteration of a bacterial host genome to produce variant strains that are resistant to 
antimicrobial agents. Bacteria with decreased mismatch repair are grown. The bacteria are 
comprised of a set of altered genes for a desired antibiotic-resistance phenotype. 

In further embodiments of the invention, methods are provided for creating a 
hypermutable bacterium using a wild-type MMR allele for antibiotic-resistance selection, and 
restoring genomic stability of a selected host by inactivating or decreasing the expression of 
the wild-type MMR allele. 

In another embodiment of the invention, a method is provided for creating a 
hypermutable bacteria using a dominant negative MMR allele for antibiotic-resistance 
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selection, and restoring genomic stability of a selected host by inactivating or decreasing the 
expression of the dominant negative MMR gene allele. 

In another embodiment of the invention, a method is provided for creating a 
hypermutable bacteria expressing an amtisense gene to a MMR gene for antibiotic-resistance 
selection, and restoring genomic stability of a selected host by inactivating or decreasing the 
expression of the dominant negative MMR gene allele. 

in another embodiment of the invention, a method is provided for creating a 
hypermutable bacteria using chemical inhibitors of MMR for antibiotic-resistance selection, 
and restoring genomic stability of a selected host by removing the chemical inhibitor, 
by introducing a dominant negative allele of a mismatch repair gene into the bacterium. The 
dominant negative allele may be, for example, a PMS2-134 gene. 

In another embodiment, mismatch repair may be blocked by exposing the bacterium a 
to a compound that inhibits mismatch repair whereby cells are grovra in the presence of the 
compound and undergo mxiltiple rounds of cell divison in the absence of MMR, yielding sibs 
that are genetically diverse. Sibs are then selected for antibiotic resistance. AR strains are 
removed from chemical inhibitor and the endogenous MMR activity is restored leaving 
genetically stable strains that are now suitable for gene discovery and/or therapeutic agent 
development. For example, the compound that blocks mismatch repair may be an anthracene 
derivative, including, but not limited 1,2-dimethylanthracene, 9,10-dimethyl anthracene, 7,8- 
dimethylanthracene, 9,10-diphenylanthracene, 9,10-dihydroxymethylanthracene, 9- 
hydroxymethyl-lO-methylanthracene, dimethylanthracene-l,2-diol, 9-hydroxymethyl-lO- 
methylantbracene-l,2-diol, 9-hydroxymethyl-l0-methylanthracene-3,4-diol, 9, 10-di-m- 
tolyanthracene. In other embodiments, the compound that blocks MMR activity is an ATP 
analog. In other embodiments, the compound that blocks MMR activity is a nuclease inhibitor. 
In other embodiments, the compound that blocks MMR activity is a DNA polymerase 
inhibitor. 

The methods of the invention may further comprise exposing the bacteria to chemical 
mutagens. While it has been documented that MMR deficiency can lead to as much as a 1000- 
fold increase in the endogenous DNA mutation rate of a host, there is no assurance that MMR 
deficiency alone will be sufficient to alter every gene within the DNA of the host bacterium to 
create altered biochemicals with new activity(s). Therefore, the use of chemical agents and 
their respective analogues such as methane sulfonate, dimethyl sulfonate, 0-6-methyl 
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benzadine, ethylnitrosourea (ENU), ethidium bromide, ethyl methanesulfonate (EMS), N- 
methyl-N'-nitro-N-nitrosoguanidine (MNNG), methylnitrosourea (MNU), Tamoxifen, 8- 
hydroxyguanine, as well as others listed but not limited to in publications by: Khromov- 
Borisov etal. {\999)Mutat. Res. 430:55-74; Ohe etal. (l999)Mutat. Res. 429:189-199; Hour 
et al. (1999) Food Chem. Toxicol. 37:569-579; Hrelia et al. (1999) Chem. Biol. Internet. 
1 1 8:99-11 1; Garganta et al. (1999) Environ. Mol. Mutagen. 33:75-85; Ukawa-Ishikawa et al 
(1998) Mutat. Res. 412:99-107; www.ehs.utah.ed-u/ohb/mutagens. etc. can be used in the 
methods of the mvention to fttrther enhance tiie spectrum of mutations and increase the 
Ukelihood of obtaining alterations in one or more genes that can in turn generate host bacteria 
witii a complex antibiotic resistant phenotype (Fu et al. (1988) Chemotherapy 34(5):415-418; 
Lee et al. (1994) Mutagenesis 9:401-405; Vidal et al. (1995) Carcinogenesis 16:817-821). 
Prior art teaches us that mismatch repaur deficiency leads to hosts with an increased resistance 
to toxicity by chemicals with DNA damaging activity. This feature allows for the creation of 
additional genetically diverse hosts when MMR defective bacteria are exposed to such agents, 
which would be otherwise impossible due to the toxic effects of such chemical mutagens 
(Colella etal. (1999) Br. J. Cancer 80:338-343; Moreland et al. (1999) Cancer Res. 59:2102- 
2106; Humbert et al. (1999) Carcinogenesis 20:205-214; Glaab etal. (1998) Mutat. Res. 
398:197-207). Moreover, prior art teaches us that MMR is responsible for repairing chemical- 
induced DNA adducts, so therefore blocking this process could theoretically increase the 
number, types, mutation rate and genomic alterations of a bacterial host [Rasmussen et al. 
(1996) Carcinogenesis 17:2085-2088; Sledziewska-Gojska a/. {1991) Mutat. Res. 383:31- 
37; Janion et al. (1989) Mutat. Res. 210:15-22). In addition to the chemicals listed above, 
other types of DNA mutagens include ionizing radiation and UV-irradiation, which are known 
to cause DNA mutagenesis in bacteria can also be used to potentially enhance this process. 
These agents, which are extremely toxic to host cells and, therefore, result in a decrease in the 
actual pool size of altered bacterial cells, are more tolerated in MMR defective hosts and in 
turn allow for a enriched spectrum and degree of genomic mutation ( Drummond et al. (1996) 
J. Biol. Chem. 271:9645-19648). such as, but not limited to methane sulfonate, dimethyl 
sulfonate, 0-6-methyl benzadine, ethylnitrosourea, ethidium bromide, ethyl methanesulfonate, 
N-methyl-N-nitro-N-nitrosoguanidine, methyhiitrosourea, Tamoxifen, and 8-hydroxyguanme. 

The methods of the invention may be used to generate AR bacteria which are resistant 
to such antibiotic compounds as, for example, quinilones, ammoglycosides, magainins. 
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defensins, tetracyclines, beta-lactams, macrolides, lincosamide, sulfonamides, 
chloramphenicols, nitrofurantoins, and isoniazids. 

In the methods of the invention, the step of determining whether the bacterium is 
resistant to an antibiotic may comprise analyzing the bacterium for multiantiboitic resistance. 

Further, the methods of the invention may comprise making antibiotic resistant bacteria 
genetically stable, such as by removing the MMR inhibitory molecule, for example. 

In the methods of the invention, the genome of the antibiotic resistant bacterium and 
the genome of a wild-type strain of the bacterium may be compared by sequence analysis of 
the entire genomes, or compared by microarray analysis, for example. In another embodiment, 
the genome of said antibiotic resistant bacterium and the genome of said wild-type strain of 
said bacterium are compared by: 

introducing gene fragments from the antibiotic resistant bacterium into the wild-type 
bacteriimi, thereby producing mutant bacteria; 

selecting a mutant bacterium with antibiotic resistance; and sequencing the gene 
fragment from the mutant bacterium with antibiotic resistance, thereby identifymg the 
antibiotic resistant gene. 

The invention also provides methods of using microbial strains that are naturally 
defective for MMR due to defects in genes encoding for MMR proteins. Strains in which 
mutS. mutL, mutH, or inutY genes are defective have been reported to be defective in MMR 
activity (Modrich (1994) Science 266: 1959-1960). The methods of the invention may employ 
bacterial strains with mutant endogenous MMR genes for selecting for variants that are AR. 
Once an AR variant strain is identified, the genetic stability of the microbe can be restored by 
expressing a functional gene that can complement the defective MMR gene activity. 

Mutant strains can be used for gene identification by isolating DNA fragments derived 
from the MMR defective antibiotic-resistant strains. These bacteria contam DNA fragments 
with altered sequences that can be infroduced into wild-type counterparts (antibiotic 
susceptible) and screened for fragments that confer antibiotic resistance. Conversely, DNA 
fragments derived from the wild-type bacteria can be introduced into mutant bacterial sti:ams 
to screen for genes effective via loss-of-function mutated genes. The fact that a clone is 
complemented suggests the introduced fragment contains a gene encoding for an antibiotic- 
resistant gene product. Other metiiods can also be used to identify AR genes including but not 
limited to microarray analysis of gene expression, differential expression and/or differential 
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pxotein analysis know by those skilled ia the art. 

The microbial strains described herein have either been generated and characterized in 
a manner which essentially provides a process by which the manipulation of MMR can confer 
AR against a wide range of anti-microbial compounds and that these strains are now useful for 
target discovery and/or therapeutic agent discovery as screening lines. 

In other embodiments of the invention, methods of producing a stable bacterium 
expressing a new phenotype is provided. Turning off the expression of the MMR- wild-type 
alleles, MMR-dominant negative alleles, or MMR-antisense alleles, results in genetically 
stable bacteria expressing a new output trait(s). 

The invention also provides antibiotic resistant strains of bacteria produced by the 
methods of the invention. 

These and other aspects of the invention provide the art with methods that can generate 
eiihanced mutability in bacteria as well as providing prokaiyotic organisms harboring 
potentially usefiil mutations to generate novel output traits for commercial applications, and 
are set fortli in greater detail below. 

BRIEF DESCRIPTION OF THE FIGURE 

Figure 1 shows growth of tetracychn-resistant mutant bacteria carrying a dominant 
negative allele of PMS2 in the pT7Ea plasmid {134/V5), tetracyclin-resistant mutant bacteria 
carrying a the PMSR3 gene in the pTTEa plasmid (R3), and wild-type bacteria carrying the 
empty pT7Ea plasmid (T7), on medium containmg tetracyclin at 0, 4 and 6 hours after 
tetracycline addition. 

DETAILED DESCRIPTION OF THE INVENTION 

The inventors present a method for developing hypermutable bacteria by altering the 
activity of endogenous mismatch repair (MMR) activity of hosts to generate antibiotic 
resistant (AR) microbes for target discovery and the development of novel anti-microbial 
agent by screening for new compounds. Wild-type and some domuiant negative alleles of 
mismatch repair genes, when introduced and expressed m bacteria, increase the rate of 
spontaneous mutations by reducmg the effectiveness of the endogenous MMR-mediated DNA 
repair activity, thereby rendering the bacteria highly susceptible to genetic alterations due to 
hypermutability. Hypermutable bacteria can then be utilized to screen for novel mutations in a 
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gene or a set of genes that produce variant siblings exhibiting new ou^ut traits not found in 
the wild-type cells such as antibiotic resistance. 

The process of mismatch repair, also called mismatch proofreading, is an 
evolutionarily highly conserved process that is carried out by protein complexes described in 
cells as disparate as prokaryotic cells such as bacteria to more complex mammalian cells 
(Modrich (1994) Science 266: 1959-1960; Strand et al. (1993) Nature 365:274-276; Su et al. 
(1988) J. Biol. Chem. 263:6829-6835; Aronshtam.and Marinus (1996) Nucl. Acids Res. 
24:2498-2504; Wu and Marinus (1994) J. Bacteriol 176:5393-400). A mismatch repair gene 
is a gene that encodes one of the proteins of such a mismatch repair complex. Although not 
wanting to be boimd by any particular theory of mechanism of action, a mismatch repair 
complex is beheved to detect distortions of the DNA helix resulting from non-complementary 
pairing of nucleotide bases. The non-complementary base on the newer DNA strand is 
excised, and the excised base is replaced with the appropriate base that is complementary to 
the older DNA strand. In this way, cells eliminate many mutations that occur as a result of 
mistakes in DNA replication, resulting in genetic stability of the siblmg cells derived from the 
parental cell. 

Some wild-type MMR gene alleles as well as dominant negative alleles cause a 
mismatch repair defective phenotype even in the presence of a wild-type MMR gene allele in 
the same cell. An example of a dominant negative allele of a MMR gene is the human gene 
hPMS2-134, which carries a truncation mutation at codon 134 (Nicolaides et al (1998) Mol. 
Cell. Biol. 18:1635-1641). The mutation causes the product of this gene to abnormally 
terminate at the position of the 134th amino acid, resulting in a shortened polypeptide 
containing the N-terminal 133 amino acids. Such a mutation causes an increase in the rate of 
mutations, which accumulate in cells after DNA repUcation. Expression of a dominant 
negative allele of a mismatch repair gene results in impairment of mismatch repair activity, 
even in the presence of the wild-type allele. Any mismatch repair allele, which produces such 
effect, can be used in this mvention. In addition, the use of over-expressed wild-type MMR 
gene alleles from human, mouse, plants, and yeast in bacteria has been shown to cause a 
dominant negative effect on the bacterial hosts MMR activity (Fishel et al. (1993) Cell 
7:1027-1038; Aronshtam and Marinus (1996) Nucl. Acids Res. 24:2498-2504; Wu and 
Marinus (1994) J. Bacteriol. 176:5393-400; Lipkin et al. (2000) Nat. Genet. 24:27-35). 
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Dominant negative alleles of a mismatch repair gene can be obtained from the cells of 
btumans, animals, yeast, bacteria, plants or other organisms. Screening cells for defective 
mismatch repair activity can identify such alleles. Mismatch repair genes may be mutant or 
wild-type. Bacterial host MMR may be mutated or not. The term bacteria used in this 
application include any organism from the prokaryotic kingdom. These organisms include 
genera such as but not limited to Agrobacterium. Anaerobacter. Aqtiabacterium, 
Azorhizobium, Bacillus, Bradyrhizobium, Cryobacterium, Escherichia, Enterococcus, 
Heliobacterium, Klebsiella, Lactobacillus, Methanococcus, Methanothermobacter, 
Micrococcus, Mycobacterium, Oceanomonas, Pseudomonas, Rhizobium, Staphylococcus, 
Streptococcus, Streptomyces, Tliennusaquaticus. Thermaerobacter, Thermobacillus, etc. 
Other procaryotes that can be used for this application are listed at 

rwww.bacterio.cict.fi:/validgenericnamesy Bacteria exposed to chemical mutagens or radiation 
exposure can be screened for defective mismatch repair. Genomic DNA, cDNA, or raRNA 
from any cell encoding a mismatch repair protein can be analyzed for variations from the 
wild-type sequence. Dominant negative alleles of a mismatch repair gene can also be created 
artificially, for example, by producing variants of the hPMS2-134 allele or other mismatch 
repair genes (Nicolaides et ah (1998) Mol Cell Biol 18:1635-1641). Various techniques of 
site-directed mutagenesis can be used. The suitabiUty of such alleles, whether natural or 
artificial, for use in generating hypermutable bacteria can be evaluated by testing the mismatch 
repair activity (using methods described in Nicolaides et al. (1998) Mol Cell Biol 18:1635- 
1 641) caused by the allele in the presence of one or more wild-type alleles, to determine if it is 
a dominant negative allele. 

A bacterium that over-expresses a wild-type mismatch repair allele or a dominant 
negative allele of a mismatch repair gene will become hypermutable. This means that the 
spontaneous mutation rate of such bacteria is elevated compared to bacteria without such 
alleles. The degree of elevation of the spontaneous mutation rate can be at least 2-fold, 5-fold, 
10-fold, 20-fold, 50-fold, 100-fold, 200-fold, 500-fold, or 1000-fold that of the normal bacteria 
as measured as a ftmction of bacterial doubling/minute. 

According to one aspect of the invention, a polynucleotide encoding either a wild-type 
or a dominant negative form of a mismatch repair protein is iatroduced into bacteria. The 
gene can be any dominant negative allele encoding a protein which is part of a mismatch 
repair complex, for example, mutS, mutL, mutH, or mutYhomologs of the bacterial, yeast. 
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plant or mammalian genes (Modrich (1994) Science 266:1959-1960; Prolla et al. (1994) 
Science 264: 1091-1093). The dominant negative allele can be naturally occurring or made in 
the laboratory. The polynucleotide can be in the form of genomic DNA, cDNA, RNA, or a 
chemically synthesized polynucleotide or polypeptide. The molecule can be introduced into 
the cell by transfection or other methods well described in the literature. 

Transfection is any process whereby a polynucleotide or polypeptide is introduced into 
a cell. The process of transfection can be carried out in a bacterial culture using a suspension 
culture. The bacteria can be any type classified vmder the prokaryotes. 

In general, transfection will be carried out using a suspension of cells but other 
methods can also be employed as long as a sufficient fraction of the treated cells incorporate 
the polynucleotide or polypeptide so as to allow transfected cells to be grown and utilized. 
The protein product of the polynucleotide may be transiently or stably expressed in the cell. 
Techniques for transfection are well known to those skilled in the art. Available techniques to 
introduce a polynucleotide or polypeptide into a prokaryote include but are not limited to 
electroporation, transduction, cell fusion, the use of chemically competent cells (e.g., calcium 
chloride), and packaging of the polynucleotide together with lipid for fusion with the cells of 
interest. Once a cell has been transformed with the inhibitory mismatch repair gene or protein, 
the cell can be propagated and manipulated in either liquid culture or on a solid agar matrix, 
such as a petri dish. If the transfected cell is stable, the gene will be retained and expressed at 
a consistent level when the promoter is constitutively active, or when in the presence of 
appropriate inducer molecules when the promoter is inducible, for many cell generations, and 
a stable, hypermutable bacterial strain results. 

An isolated bacterial cell is a clone obtained from a pool of a bacterial culture by 
chemically selecting out strams using antibiotic selection of an expression vector. If the 
bacterial cell is derived from a single cell, it is defined as a clone. 

Bacterial cultures may be screened for antibiotic resistance against a wide array of 
antibiotic compounds. For example, but not by way of limitation, bacteria produced by the 
methods of the invention may be screened for resistance to quinilones, aminoglycosides, 
magainins, defensins, tetracyclines, beta-lactams, macroUdes, lincosamide, sulfonamides, 
chloramphenicols, nitrofurantoins, and isoniazids. The antibiotics vasiy be incorporated into 
soUd or liquid growth medium, for example. 
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A polynucleotide encoding an inhibitory fomi of a mismatch repair protein can be 
introduced into the genome of a bacterium or propagated on an extra-chromosomal plasmid. 
Selection of clones harboring the mismatch repair gene expression vector can be accomplished 
by addition of any of several different antibiotics, including but not limited to ampicillin, 
kanamycin, chloramphenicol, zeocin, and tetracycline. The microbe can be any species for 
which suitable techniques are available to produce transgenic microorganisms, such as but not 
limited to genera including Bacillus, Pseudomonas, Staphylococcus, Escherichia and others. 

Any method for making transgenic bacteria known in the art can be used. According 
to one process of producing a transgenic microorganism, the polynucleotide is transfected into 
the microbe by one of the methods well known to those in the art. Next, the microbial culture 
is grown under conditions that select for cells in which the polynucleotide encoding the 
mismatch repair gene is either uicorporated into the host genome as a stable entity or 
propagated on a self-replicating extra-chromosomal plasmid, and the protein encoded by the 
polynucleotide fragment transcribed and subsequently translated into a functional protein 
within the cell. Once a transgenic microbe is engineered to harbor the expression construct, it 
is then propagated to generate and sustain a culture of transgenic microbes indefinitely. 

Once a stable, transgenic microorganism has been engineered to express a functional 
MMR protein, the microbe can be exploited to create novel mutations in one or more target 
gene(s) of interest harbored within the same microorganism. A gene of interest can be any 
gene naturally possessed by the bacterium or one introduced into the bacterial host by standard 
recombinant DNA techniques. The target gene(s) may be known prior to the selection or 
unknown. One advantage of employing such transgenic microbes to induce mutations in 
resident or extra-chromosomal genes within the microbe is that it is uimecessary to expose the 
microorganism to mutagenic insult, whether it be chemical or radiation in nature, to produce a 
series of random gene alterations in the target gene(s). This is due to the highly efficient 
nature and the spectrum of naturally occurrmg mutations that result as a consequence of the 
altered mismatch repair process. However, it is possible to increase the spectrum and 
frequency of mutations by the concomitant use of either chemical and/or radiation together 
with MMR defective cells. The net effect of the combuiation treatment is the increase in 
altered gene pool in the genetically altered microbe that result in an increased alteration of an 
allele(s) that are useful for producing new output traits. Other benefits of using MMR- 
defective microbes that are taught ui this application are genetic screens for the DIRECT 
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selection of variant sub-clones that exhibit new output traits with commercially important 
applications such as antibiotic resistance, which allows the bypassing of the tedious and time 
consuming gene identification, isolation and characterization stages. 

Mutations can be detected by analyzing the recombinant microbe for alterations in the 
genotype and/or phenotype post-activation of the decreased mismatch repair activity of the 
transgenic microorganism. Novel genes that produce altered phenotypes in MMR-defective 
microbial cells can be discerned by any variety of molecular techniques well known to those 
in the art. For example, the microbial genome can be isolated and a library of restriction 
fragments cloned into a plasmid vector. The library can be introduced into a "normal" cell 
and the cells exhibiting the novel phenotype screened. Transformed cells are then screened 
for the new phenotype (e.g., antibiotic resistance). A plasmid is isolated fi-om those normal, 
transformed cells that exhibit the novel phenotype and the inserted gene(s) characterized by 
DNA sequence analysis. 

Alternatively, differential messenger RNA screen can be employed utilizing driver and 
tester RNA (derived from wild-type and novel mutant respectively) followed by cloning the 
differential transcripts and characterizing them by standard molecular biology methods well 
known to those skilled in the art. Furthermore, if the mutant sought is on encoded by an 
extrachromosmal plasmid, then following co-expression of the dominant negative MMR gene 
and the gene of interest to be altered and phenotypic selection, the plasmid is isolated from 
mutant clones and analyzed by DNA sequence analysis by methods well known to those in the 
art. 

In another embodiment, the screening of cells may be performed by microarray 
analysis. Li microarray analysis, microchips containing all or a subset of all expressed 
bacterial genes may be screened using RNA molecules derived from the wild-type or 
antibiotic resistant strain whereby RNA derived from one strain is reverse transcribed using 
FluoroLink Cy3 and the other RNA sample is reverse transcribe-labelled using Cy5 dUTP. 
Labelled cDNAs from each organism are used to probe the microchip whereby unique 
message from one source will generate a distinct signal while message expressed from both 
sources will generate a common fluorescence. Alternatively, microchips containing 
olignucleotide derived from the wild-type strain can be used to hybridize genomic fragments 
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&om the antibiotic resistant strain to identify fragments containing a mutated gene by loss of 
hybridization. 

Phenotypic screening for output traits in MMR-defective mutants can be by 
biochemical activity and'or a physical phenotype of the altered gene product. A mutant 
phenotype can also be detected by identifying alterations in electrophoretic mobility, DNA 
binding in the case of transcription factors, spectroscopic properties such as IR, CD, X-ray 
crystallography or high field NMR anafysis, or other physical or structural characteristics of a 
protein encoded by a mutant gene. It is also possible to screen for altered novel function of a 
protein in situ, in isolated form, or in model systems. One can screen for alteration of any 
property of the microorganism associated with the function of the gene of interest, whether the 
gene is known prior to the selection or unknown. The aforementioned screemng and selection 
discussion is meant to illustrate the potential means of obtaining novel mutants with 
commercially valuable output traits. 

' Plasmid expression vectors that harbor the mismatch repair (MMR) gene mserts can be 
used ui combmation with a number of commercially available regulatory sequences to control 
bolh the temporal and quantitative biochemical expression level of the dominant negative 
MMR protein. The regulatory sequences can be comprised of a promoter, enhancer or 
promoter/enhancer combmation and can be inserted either upstream or downstream of the 
MMR gene to control the expression level. The regulatory promoter sequence can be any of 
those well known to those in the art, includuxg but not lunited to the lad, tetracycline, 
tryptophan-inducible, phosphate mducible, T7-polymerase-inducible (Studier et al. (1991) J. 
Mol. Biol. 219(l):37-44), and steroid inducible constructs as well as sequences which can 
result ui the excision of the dominant negative mismatch repair gene such as those of the Cre- 
Lox system. These ty^jes of regulatory systems have been listed in scientific publications and 
are familiar to those skilled in the art. 

Once a microorganism with a novel, desired output trait of mterest is created, the 
activity of the aberrant MMR activity is attenuated or eliminated by any of a variety of 
methods, including removal of the inducer from the culture medium that is responsible for 
promoter activation, gene disruption of the aberrant MMR gene constructs, electroporation 
and or chemical curmg of the expression plasmids (Brosius(1988) Biotechnology 10:205-225; 
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Wang et al. (1999) J. ofFujian Agricultural University 28:43-46; Fu et. al (1988) Chem, 
Abstracts 34:415-418). The expression of the dominant negative MMR gene will be turned on 
to select for hypermutable microbes with new output traits. Next, the expression of the 
dominant negative dominant negative MMR allele is rapidly turned off to reconstitute a 
genetically stable strain that produces a new output trait of commercial interest. The resulting 
microbe is now useful as a stable strain that can be applied to various commercial applications, 
depending upon the selection process placed upon it. 

In cases where genetically deficient mismatch repair bacteria (strains such as but not 
limited to: Ml (mutS) and in EC2416 (mutS delta umuDC), and mutL or mutY strains) are 
used to derive new output traits, transgenic constructs will be used that express wild-type 
mismatch repair genes sufficient to complement the genetic defect and therefore restore 
mismatch repair activity of the host after trait selection (Grzesiuk et al (1988) Mutagenesis 
13:127-132; Bridges et al. (1997) EMBO J. 16:3349-3356; LeClerc (1996) Science 15:1208- 
1211; Jaworski, A. et al. (1995) /Voc. Natl. Acad. Sci USA 92:11019-11023). The resulting 
microbe is genetically stable and can be applied to various commercial practices. 

The use of over expressing foreign mismatch repair genes from human and yeast such 
as human PMSl (SEQ ID N0:7), human PMS2 (SEQ ID N0:5), mouse PMS2 (SEQ ID 
N0:3), human MSH2 (SEQ ID N0:9), human liLHl (SEQ ID NO: 1 1), yeast MLHl (SEQ ID 
NO: 1), human MLH3 (SEQ ID NO:28), as well as the other homologs identified in other 
species for these encoded polypeptides efc.have been previously demonstrated to produce a 
dominant negative mutator phenotype in bacterial hosts (Brosh and Matson (1995) J. 
Bacteriol. 177:5612-5621; Studamiie et al. (1998) Mo/. Cell. Biol 18:7590-7601; Alani et al 
(1997) Mol Cell 5io/. 17:2436-2447). In addition, the use of bacterial strains expressing 
prokaryotic dominant negative MMR genes as well as hosts that have genomic defects in 
endogenous MMR proteins have also been previously shown to result in a dominant negative 
mutator phenotype (Strand et al (1993) Nature 365:274-276; Nicolaides et al (1998) Mol 
Cell Biol 18:1635-1641). However, the findings disclosed here teach the use of MMR 
genes, including the human PMSR2 and PMSR3 gene (Nicolaides et al (1995) Genomics 
30:195-206); the related PJ\f573^ truncated MMR gene (Nicolaides et al{\99%) Mol Cell 
Biol 18:1635-1641); the plant mismatch repair genes (derived &om Arabidopsis tlialiand). 
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ATPMS2 (SEQ ID NO:30). At PMSl (SEQ ID NO:32), and M/f^'honiolog (SEQ ID NO:34) 
and those genes that are homologous to the 134 N-terminal amino acids of the PMS2 gene 
which include the MutL family of MMR proteins and including the PMSR and PMS2L 
homologs described by Hori et al. (PMS2L8 (SEQ ID NO:36) and PMS2L9 (SEQ ID 
NO:38)) andNicolaides (Nicolaides etal. (1995) Genomics 30:195-206) to create 
liypermutable microbes. The corresponding polypeptide sequences for the above-referenced 
nucleic acid sequences are as follows; yeast MLHl (SEQ ID N0:2); mouse PMS2 (SEQ ID 
N0:4); human PMS2 (SEQ ID N0:6); human PMSl (SEQ ID N0:8); human MSH2 (SEQ ID 
N0:10); human MLHl (SEQ ID N0:12); PMS2-134 (SEQ ID N0:14); human MSH6 (SEQ 
ID NO: 16); human PMSR2 (SEQ ID NO: 18); human PMSR3 (SEQ ID NO:20); human 
PMSL9 (SEQ ID NO:22); human MLH3 (SEQ ID NO:29); ATPMS2 (SEQ ID N0:3 1); 
ATPMSl (SEQ ID NO:33); At MutS homolog (SEQ ID NO:35); PMS2L8 (SEQ ID NO:37); 
and PMS2L9 (SEQ ID NO:39). 

In addition, the invention provides the use of DNA mutagens in combination with 
MMR defective microbial hosts to enhance the hypermutable production of genetic alterations. 

. This has not been demonstrated in the art previously as a means to accentuate MMR activity 
for generation of microorganisms with clinically relevant output traits such as antibiotic 

^resistance. 

In some embodiments of the mvention, the bacteria cells are rendered hypermutable by 
introducing a chemical inhibitor of mismatch repair into the growth medium. Chemical 
inhibitors of mismatch repair that may be used to generate hypermutable bacterial cells 
include anthracene-derived compounds comprising the formula: 



In certain preferred embodiments of the invention, the anthracene has the formula: 
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10 R4 


wherein Ri-Rio are independently hydrogen, hydroxyl, amino, alkyl, substituted alkyl, alkenyl, 
substituted alkenyl, aUcynyl, substituted aUcynyl, 0-alkyl, S-alkyl, N-alkyl, O-alkenyl, S- 
alkenyl, N-alkenyl,0-alkynyl, S-alkynyl, N-alkynyl, aryl, substituted aryl, aryloxy, substituted 
aryloxy, heteroaryl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 
arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
an amino acid, sulfonate, alkyl sulfonate, CN, NO2, an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an organometallic group, a carboxyUc acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

wherein said heteroaUcyl, heteroaryl, and substituted heteroaryl contain at least one 
heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, 

substituted aryl, and substituted heteroaryl are halogen, CN, NO2, lower aUcyl, aryl, heteroaryl, 
aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups; 

or wherein any two of Ri-Rio can together form a polyether; 

or wherein any two of Ri-Rio can, together with the intei-vening carbon atoms of the 
anthracene core, form a crown ether. 

As used herein, "aUcyl" refers to a hydrocarbon containing from 1 to about 20 caibon 
atoms. Alkyl groups may straight, branched, cyclic, or combinations thereof. AUcyl groups 
thus include, by way of illustration only, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, 
cyclopentyl, cyclopentylmethyl, cyclohexyl, cyclohexyhnethyl, and the like. Also mcluded 
within the definition of "alkyl" are fused and/or polycyclic aliphatic cyclic ring systems such 
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as, for example, adamantane. As used herein the term "alkenyl" denotes an alkyl gioup 
having at least one carbon-carbon double bond. As used herein the term "alkynyl" denotes an 
alkyl group having at least one carbon-carbon triple bond. In certain preferred embodiments of 
the invention, the anthracene has the formula: 


wherein Ri-Rjo are independently hydrogen, hydroxyl, amino, alkyl, substituted allcyl, alkenyl, 
substituted alkenyl, allcynyl, substituted alkynyl, 0-alkyl, S-alkyl, N-alkyl, 0-alkenyl, S- 
alkenyl, N-alkenyl,0-alkynyl, S-alkynyl, N-alkynyl. aryl. substituted aryl, aryloxy, substituted 
aryloxy, heteroaryl, substituted heteioaryl, aralkyloxy, aiylalkyl, alkylaryl, alkylaryloxy, 
arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
an amino acid, sulfonate, alkyl sulfonate, CN, NO2, an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an oi^anometallic group, a caxboxylic acid, 
an organosiUcon or a carbohydrate that optionally contains one or more alkylated hydroxyl 


wherein said heteroalkyl, heteroaryl, and substituted heteroaryl contain at least one 
heteroatom that is oxygen, sulfor, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, 

substituted aryl, and substituted heteroaryl are halogen, CN, NO2, lower alkyl, aryl, heteroaryl, 
aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups; 

or wherein any two of Ri-Rm can together form a polyether; 

or wherein any two of Ri-Rjo can, together with the mtervening carbon atoms of the 
anthracene core, form a crown ether. 



groups; 
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As used herein, "alkyl" refers to a hydrocarbon containing from 1 to about 20 carbon 
atoms. Alkyl groups may straight, branched, cyclic, or combinations thereof. Alkyl groups 
thus include, by way of illustration only, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, 
cyclopentyl, cyclopentylmethyl, cyclohexyl, cyclohexylmethyl, and the like. Also included 
within the definition of "alkyl" are fused and'or polycyclic aliphatic cyclic ring systems such 
as, for example, adamantane. As used herein the tema "alkenyl" denotes an alkyl group 
having at least one carbon-carbon double bond. As used herein the term "aUsynyl" denotes an 
alkyl group having at least one carbon-carbon triple bond. 

In some embodiments, the anthracene has the formula: 


wherein Ri-Rio are independently hydrogen, hydroxyl, amino, alkyl, substituted aUcyl, alkenyl, 
substituted alkenyl, alkynyl, substituted alkynyl, 0-alkyl, S-alkyl, N-alkyl, 0-alkenyl, S- 
alkenyl, N-alkenyl,0-alkynyl, S-aUcynyl, N-alkynyl, aryl, substituted aryl, aryloxyj substituted 
aiyloxy, heteroaryl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 

arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
an amino acid, sulfonate, alkyl sulfonate, CN, NO2, an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an organometalUc group, a carboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

wherein said heteroaLkyl, heteroaryl, and substituted heteroaryl contain at least one 
heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, 

substituted aryl, and substituted heteroaryl are halogen, CN, NO2, lower alkyl, aryl, heteroaryl, 
aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 
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and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aiyl 
or lower alkyl groups. 

Examples of such anthracenes include, but are not limited to 1,2-dimethylantbi-acene, 
9,10-dimethyl anthracene, 7,8-diniethylantkacene, 9,10-diphenyIanthracene, 9,10- 
dihydroxymethylanthracene, 9-hydroxymethyl- 1 0-methylanthracene, dimethy [anthracene- 1 ,2- 
diol, 9-hydroxymethyl- 1 0-methylanthracene- 1 ,2-diol, 9-hydroxymethyl- 1 0-methylanthracene- 
3,4-diol, and 9, 10-di-m-tolyanthracene. 

As used herein the term "anthracene" refers to the compound anthracene. However, 
when referred to in the general sense, such as "anthracenes," "an anthracene" or "the 
anthracene," such terms denote any compound that contains the fused triphenyl core structure 
of anthracene, i.e.. 


regardless of extent of substitution. 

In some embodiments, the allcyl, aikenyl, alkynyl, aryl, aryloxy, and heteroaryl 
substituent groups described above may bear one or more further substituent groups; that is, 
they may be "substituted". In some prefened embodiments these substituent groups can 
include halogens (for example fluorine, chlorine, bromine and iodine), CN, NO2, lower alkyl 
groups, aryl groups, heteroaryl groups, araUcyl groups, aralkyloxy groups, guanidino, 
alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino groups. In addition, the alkyl and aryl 
portions of aralkyloxy, arylalkyl, arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, and 
aiyloxycarbonyl groups also can bear such substituent groups. Thus, by way of example only, 
substituted alkyl groups include, for example, alkyl groups fluoro-, chloro-, bromo- and 
iodoalkyi groups, aminoalkyl groups, and hydroxyalkyl groups, such as hydroxymethyl, 
hydroxyethyl, hydroxypropyl, hydroxybutyl, and the like. In some preferred embodiments 
such hydroxyalkyl groups contain from 1 to about 20 carbons. 

As used herein the term "aryl" means a group having 5 to about 20 carbon atoms and 
which contains at least one aromatic ring, such as phenyl, biphenyl and naphthyl. Prefeixed 
aryl groups include unsubstituted or substituted phenyl and naphthyl groups. The term 


wo 03/012130 


PCT/USOl/23888 


-22- 

"aryloxy" denotes an aryl group that is bound through an oxygen atom, for example a phenoxy 
group. 

In general, the prefix 'Tietero" denotes the presence of at least one hetero (i.e., non- 
carbon) atom, which is in some preferred embodiments independently one to three 0, N, S, P, 
Si or metal atoms. Thus, the term "heteroaryl" denotes an aryl group in which one or more 
ring carbon atom is replaced by such a heteroatom. Preferred heteroaryl groups include 
pyridyl, pyrimidyl, pyrrolyl, furyl, tliienyl, and imidazolyl groups. 

The term "aralkyl" (or "arylalkyl") is intended to denote a group having from 6 to 15 
carbons, consisting of an alkyl group that bears an aryl group. Examples of aralkyl groups 
include benzyl, phenethyl, benzhydryl and naphthyhnethyl groups. 

The term "alkylaryl" (or "alkaiyl") is intended to denote a group having from 6 to 15 
carbons, consisting of an aryl group that bears an alkyl group. Examples of araUcyl groups 
include methylphenyl, ethylphenyl and methybaphthyl groups. 

The term "arylsulfonyl" denotes an aryl group attached through a sulfonyl group, for 
example phenylsulfonyl. The term "alkylsulfonyl" denotes an alkyl group attached through a 
sulfonyl group, for example methylsulfonyl. 

The term "alkoxycarbonyl" denotes a group of formula -C(=0)-0-R where R is alkyl, 
alkenyl, or alkynyl, where the alkyl, alkenyl, or aflcynyl portions thereof can be optionally 
substituted as described herein. 

The tenn "aryloxycarbonyl" denotes a group of formula -C(=0)-0-R where R is aryl, 
where the aryl portion thereof can be optionally substituted as described herein. 

The terms "arylalkyloxy" or "aralkyloxy" are equivalent, and denote a group of 
formula -O-R'-R'', where R' is R is alkyl, alkenyl, or alkynyl which can be optionally 
substituted as described herem, and wherein r" denotes a aryl or substituted aryl group. 

The terms "alkylaryloxy" or "alkaryloxy" are equivalent, and denote a group of 
formula -O-R'-R", where R' is an aryl or substituted aryl group, and R" is alkyl, alkenyl, or 
alkynyl which can be optionally substituted as described herein. 

As used herein, the term "aldehyde group" denotes a group that bears a moiety of 
formula -C(=0)-H. The term "ketone" denotes a moiety containing a group of formula -R- 
C(=0)-R=, where R and R= are independently alkyl, alkenyl, alkynyl, aryl, heteroaryl, aralkyl, 
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or alkaryl, each of which may be substituted as described herein. 

As used herein, the term "ester" denotes a moiety having a group of formula -R- 
C(=0)-0-R= or -R-0-C(=0)-R= where R and R= are independently alkyl, alkenyl, alkynyl, 
aryl, heteroaiyl, aralkyl, or alkaryl, each of which may be substituted as described herein. 

The term "ether" denotes a moiety having a group of formula -R-0-R= or where R and 
R= are independently alkyl, alkenyl, alkynyl, aryl, heteroaryl, aralkyl, or alkaryl, each of 
which may be substituted as described herein. 

The term "crown ether" has its usual meaning of a cyclic ether containing several 
oxygen atoms. As used herein the term "organosulfur compound" denotes aliphatic or 
aromatic sulfur containing compoimds, for example thiols and disulfides. The term 
"organometallic group" denotes an organic molecule containing at least one metal atom. 

The term "organosiUcon compound" denotes aliphatic or aromatic silicon containing 
compounds, for example alkyl and aryl silanes. 

The term "carboxyUc acid" denotes a moiety having a carboxyl group, other than an 
amino acid. 

As used herein, the term "amino acid" denotes a molecule containing both an amino 
group and a carboxyl group. In some preferred embodiments, the amino acids are a-, j- or 
5-amino acids, including their stereoisomers and racemates. As used herein the term "L- 
amino acid" denotes an OE-amino acid having the L configuration around the ce-carbon, that is, 
a carboxylic acid of general formula CH(C00H)(NH2Kside cham), having the L- 
configuration. The term "D-amino acid" similarly denotes a carboxylic acid of general 
formula CH(CO0H)(NH2)-(side chain), having the D-configuration around the ce-carbon. 
Side chains of L-amiuo acids include naturally occurring and non-naturally occurring 
moieties. Non-naturally occurring (i.e., unnatural) amino acid side chains are moieties that are 
used in place of naturally occurring amino acid side chains in, for example, amino acid 
analogs. See, for example, Lehninger, Biochemistiy, Second Edition, Worth Publishers, Inc, 
1975, pages 72-77, incorporated herein by reference. Amino acid substituents may be 
attached through their carbonyl groups through the oxygen or carbonyl carbon thereof, or 
through their amino groups, or through ftinctionalities residing on their side chain portions. 
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In some embodiments of the methods of the invention, the cells are made 
hypennutable using ATP analogs capable of blocking ATPase activity required for MMR. 
MMR reporter cells are screened with ATP compound libraries to identify those compounds 
capable of blocking MMR in vivo. Examples of ATP analogs that are useful in blocking MMR 
activity include, but are not limited to, nonhydrolyzable forms of ATP such as AMP-PNP and 
ATP[gamma]S block the MMR activity (GaHo, L. et al (1999) Nucl. Acids Res. 27:2325- 
2331; Allen, D.J. et al. (1997) EA4B0 J. 16:4467-4476; Bjomson K.P. et al. (2000) Biochem. 
39:3176-3183). The ATPase inhibitors inhibit MMR and the cells become hypermutable as a 
result. 

In other embodiments of the methods of the invention, the cells are made hypermutable 
using nuclease inhibitors that are able to block the exonuclease activity of the MMR 
biochemical pathway. MMR reporter cells are screened with nuclease mhibitor compound 
libraries to identify compounds capable of blocking MMR in vivo. Examples of nuclease 
inhibitors that are useful in blocking MMR activity include, but are not limited to analogs of 
N-Ethylmaleimide, an endonuclease inhibitor (Huang, Y.C., et.al. (1995) Arch. Biochem. 
Biophys. 316:485), heterodimeric adenine-chain-acridine compounds, exonulcease III 
inhibitors (Belmont P, et.al., BioorgMed ChemLett (2000) 10:293-295), as well as antibiotic 
compounds such as Heliquinomycin, which have helicase inhibitory activity (Chino, M, et.al. 
J. Antibiot. (Tokyo) (1998) 51:480-486). The nuclease inhibitors mhibit MMR and the cells 
become hypermutable as a result. 

In other embodiments of the methods of the invention, the cells are made hypermutable 
using DNA polymerase inhibitors that are able to block the polymesrization required for 
mismatch-mediated repair. MMR reporter cells are screened with DNA polymerase inhibitor 
compound Ubraries to identify those compounds capable of blocking MMR in vivo. Examples 
of DNA polymerase inhibitors that are useful in blocking MMR activity include, but are not 
limited to, analogs of actinomycin D (Martin, S.J., et.al. (1990) J. Immunol. 145:1859), 
Aphidicolin (Kuwakado, K. et.al. (1993) Biochem. Pharmacol. 46:1909) l-(2'-Deoxy-2'- 
fluoro-beta-L-arabiaofiiranosyl)-5-methyluracil (L-FMAU) (Kukhanova M, et.al., Biochem 
Pharmacol {199S) 55:1181-1187), and 2',3'-dideoxyribonucleoside 5'-triphosphates (ddNTPs) 
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(Ono. K., etal., Biomed Pharmacother (1984) 38:382-389). Hie polymerase inhibitors inhibit 
MMR and the cells become hypermutable as a result. 

Bacterial cells rendered hypermutable using chemical inhibitors of MMR may be made 
genetically stable when the desired phenotype is obtained by removing the MMR inhibitory 
molecule. 

In certain embodiments, the bacterial cells are made hypermutable by introducing 
plamids that generate antisense messages wherein the antisense RNA specifically bind to 
MMR genes and prevent efficient expression of MMR proteins. Preferably, the antisense 
transcripts are at least 12 nucleotides m length and, more preferably are at least 20, 30, 40, 50 
nucleotides or more ui length. The antisense transcripts specifically bind to regions of the 
MMR gene to inhibit expression. Preferably, the antisense transcripts specifically bind to ' 
regulatory regions of the MMR gene such as to the MMR promoter region, Kozak consensus 
sequences, and the like. As used herein, "specifically bind" refers to association of nucleic 
acid strands forming complementary base pairing in Watson-Crick anangement, allowing for 
mismatches such that 100% complementarity is not required. In general, the complementarity 
will be about 85%, 90%, 95% or more. Plasmids that may be used to express an antisense 
MMR transcript include any vector generally known in the art to express antisense transcripts, 
such as for example, those found in Qian Y. et al (1998) Mutat. Res. 418(2-3):61-71. 

The above disclosure generally describes the present invention. A more complete 
understanding can be obtained by reference to the following specific examples that will be 
provided herein for purposes of illustration only, and are not intended to Imiit the scope of the 
uivention. 


Example 1: Generation of MMR defective bacteria. 

Bacterial expression constmcts were prepared using the human PMS2 related gene 
(hPMSR3) (Nicolaides et al. (1995) Genomics 30:195-206) and the human PMS134 cDNA 
(Nicolaidesera/. (1998) M?/. Cell. Biol. 18:1635-1641), both of which are capable of 
inactivatmg MMR activity and thereby mcrease the overaU frequency of genomic 
hypermutation. Moreover, the use of regulatable expression vectors will allow for 
suppression of dominant negative MMR alleles and restoration of the MMR pathway and 


wo 03/012130 


PCT/USOl/23888 


-26- 

genetic stability in hosts cells (Brosius, J. (1988) Biotechnology 10:205-225). For fliese 
studies, a plasmid encoding the hPMS134 cDNA was altered by polymerase chain reaction 
(PGR). The 5' oligonucleotide has the following sequence: 5'-acg cat atg gag cga get gag age 
teg agt-3' (SEQ ID NO:23)that includes the Ndel restriction site (cat atg). The 3'- 
oligonucleotide has the following sequence: 5'-gaa ttc tta tea cgt aga ate gag acc gag gag agg 
gtt agg gat agg ctt acc agt tec aac ctt cgc cga tgc-3' (SEQ ID NO:24) that includes an EcoRI 
site (gaattc) and the 14 amino acid epitope for the V5 antibody. The oligonucleotides were 
used for PGR under standard conditions that included 25 cycles of PGR (95°C for 1 minute, 
55°G for 1 minute, 72'^C for 1 .5 minutes for 25 cycles followed by 3 minutes at 72°C). The 
PGR fragment was purified by gel electrophoresis and cloned into pTA2.1 (InVitrogen) by 
standard clonmg methods (Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 
THIRD Edition, 2001), creating the plasmid pTA2.1-hPMS 134. niepTA2.1-hPMS134 
plasmid was digested with the restriction enzyme EcoRE to release the insert (there are two 
EcoRI restriction sites in the multiple cloning site of pTA2.1 liiat flank the msert) and the 
fragment was end-filled using Klenow fragment and dNTPs. Next, the fragment was gel 
purified, digested with Ndel, and inserted in pT7-Ea (which had been digested with Ndel and 
BamHI, end-fiUedusing Klenow, and phosphatase treated). The new plasmid was designated 
pT7-Ea-hPMS134. 

The following sti-ategy, similar to that described above to clone human PMS 134, was 
used to construct an expression vector for the human related gene PMSR3. First, the hPMSR3 
fragment was amplified by PGR to introduce two restriction sites: an Ndel restriction site at 
the 5'- end, and an Eco RI site at the 3'-end of the fragment. The 5 '-oligonucleotide that was 
used for PGR has the foUowmg sequence: 5'-acg cat atg tgt cct tgg egg cct aga-3' (SEQ ID 
NO:25) that includes the Ndel restriction site (CAT ATG). The 3 '-oligonucleotide used for 
PGR has the foUowrag sequence: 5'-gaa ttc tta tta cgt aga ate gag acc gag gag agg gtt agg gat 
agg ctt acc cat gtg tga tgt ttc aga gct-3' (SEQ ID NO:26) that includes an EcoRI site (gaattc) 
and the V5 epitope to allow for antibody detection. The plasmid that contained human 
PMSR3 inpBluescript SK (Nicolaides et al. (1995) Genomics 30:195-206) was used as the 
PGR target with the hPMS2-specific oligonucleotides above. Following 25 cycles of PGR 
(95°G for 1 minute, 55''G for 1 minute, 72°G for 1.5 mmutes for 25 cycles foUowed by 3 
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minutes at 72°C). The PGR fragment was purified by gel electrophoresis and cloned into 
pTA2.1 (hiVitrogen) by standard cloning methods (Sambrook et al. Molecular CLONING: A 
Laboratory Manual, Third Edition, 2001), creating the plasmid pTA2. l-hR3. The 
pTA2.1-hR3 plasmid was next digested with the restriction enzyme EcoRJ to release the insert 
(there are two EcoRI restriction sites in the multiple cloning site of pTA2.1 that flank the 
insert) and the fragment was end-filled using Klenow fragment and dNTPs. Then, the 
firagment was gel purified, digested with Ndel, and inserted in pT7-Ea (which had been 
digested with Ndel and BamHI, end-filled using Klenow, and phosphatase treated). The new 
plasmid was designated pT7-Ea-hR3. 

BL21 cells harbor an additional expression vector for the lysozyme protein, which has 
been demonstrated to bind to the T7 polymerase in situ; this results in a bacterial strahi that 
has very low levels of T7 polymerase expression. However, upon addition of the inducer 
isopropyl-beta-D-thiogalactopyranoside (IPTG), the cells express high-levels of T7 
polymerase due to the IPTG-mducible element that drives expression of the polymerase that is 
resident within the genome of tlie BL21 cells (Studier et al. (1991) /. MoL Biol. 219(1);37- 
44). The BL21 cells are chloramphenicol resistant due to the plasmid that expresses lysozyme 
within the cell. To introduce the pT7-hPMS134 or the pT7-hPMSR3 genes into BL21 cells, 
the cells were made competent by incubatmg the cells hi ice cold 50niM CaCU for 20 minutes, 
followed by concentrating the cells and adding super-coiled plasmid DNA as describe ( 
Sambrook et al.. Molecular CLONING: A Laboratory Manual, Third Edition, Cold 
Spring Harbor Laboratory Press, 2001). Ampicillin resistant BL21 were selected on LB-agar 
plates [5% yeast extract, 10% bactotiyptone, 5% NaCl, 1.5% bactoagar, pH 7.0 (Difco)] plates 
containing 25 Mg/ml chloramphenicol and lOO/tg/ml ampicillm. The next day, bacterial 
colonies were selected and analyzed by restriction endonuclease digestion and sequence 
analysis for plasmids containing an intact pTACPMS134 or pTAC empty plasmid . 

In addition to constructing a V5-epitope tagged PMS134 construct, we also 
constructed and tested a non-epitope tagged version. This was prepared to demonstrate that 
the epitope tag did not cause the alteration of the dominant-negative phenotype that PMS134 
has on mismatch repair activity. For these studies, a BamHI restriction fragment containing 
the hPMS134 cDNA was fiUed-in using Klenow fragment and then sub-cloned into a 
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Klenow-fiUed, blmt-endedNdel-XhoI site of thepTACXAC expression vector (which 
contains the IPTG-inducible bacterial TAG promoter and ampicillin resistance gene as 
selectable marker). The Ndel-Xhol cloning site is flanked by the TACLAC promoter that 
contains the Lad repressor site followed by a Shine-Dalgamo ribosome-binding site at the 5' 
flanking region and the T1T2 ribosomal RNA terminator in the 3' flanking region. The 
TACLAC vector also contains the Lad gene, which is constitutively expressed by the TAC 
promoter. 

DHIOB bacterial cells containing die pBCSK vector (Stratagene), which constitutively 
expresses the i8-galactosidase gene and contains the chloramphenicol resistance marker for 
selection, were made competent via the CaClj method (Sambrook, et al MOLECULAR 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1982). This 
vector turns bacterial cells blue when grown in the presence of IPTG and X-gal that aids in the 
detection of bacterial colonies. Competent cells were transfected with the pTAC empty vector 
or the Tp1ACPMS134 vector followmg the heat-shock protocol. Transfected cultures were 
plated onto LB-agar [5% yeast extract, 10% bactotryptone, 5% NaCl, 1.5% bactoagar, pH 7.0 
(Difco)] plates containing 25 /ig/ml chloramphenicol and 100 /ig/ml ampicillin. The next day, 
bacterial colonies were selected and analyzed by restriction endonuclease digestion and 
sequence analysis for plasmids containing an intact pTACPMS134 or pTAC empty plasmid. 
Ten clones of each bacteria contaming correct empty or PMS134 inserts were then grown to 
confluence overnight in LB media (5% yeast extract, 10% bactotryptone, 5% NaCl, pH 7.0) 
containing 10 jtig/ml chloramphenicol and 50 /ig/ml ampicillin. The next day TAC empty or 
pTACPMS134 cultures were diluted 1 :4 in LB medium plus 50 /dVl IPTG (Gold 
Biotechnology) and cultures were grown for 12 and 24 hours at 37°C. After incubation, 50 jtil 
aliquots were taken from each culture and added to 150 [lis of 2X SDS buffer and cultures 
were analyzed for PMS134 protein expression by western blot. 

Western blots were carried out as follows: 50 jiils of each PMS134 or empty plasmid 
culture was directly lysed in 2X lysis buffer (60 mM Tris, pH 6.8, 2% SDS, 10% glycerol, 0.1 
M 2-mercaptoethanol, 0.001% bromophenol blue) and samples were boiled for 5 minutes. 
Lysate proteins were separated by electrophoresis on 4-20% Tris glycine gels (Novex). Gels 
were electroblotted onto hnmobilon-P (MUlipore) in 48 mM Tris base, 40 mM glycine, 
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0.0375% SDS, 20% methanol and blocked overnight at 4°C in Tris-buffered saline plus 0.05% 
Tween-20 and 5% condensed milk. Filters were probed with a rabbit polyclonal antibody 
generated against the N-terminus of the human PMS2 polypeptide (Santa Cruz), which is able 
to recognize the PMS134 polypeptide (Su et al. (1988) /. Biol. Chem. 263:6829-6835), 
followed by a secondary goat anti-rabbit horseradish peroxidase-conjugated antibody. 
Alternatively, blots were probed with an anti-V5 monoclonal antibody followed by a 
secondary goat anti-mouse horseradish peroxidase-conjugated antibody. After incubation 
with the secondary antibody, blots are developed using chemiluminescence (Pierce) and 
exposed to film to measure PMS134 expression. 

For induction of PMS gene product, 5 ml cultures of Luria Broth (LB) plus 50 jig/ml 
ampicillin were inoculated from glycerol stocks of the transformants pT7Ea (BL21), 
pT7PMS134/V5 (BL21), or pT7PMSR3 (BL21) and grown overnight at 37°C with shaking. 
200 |il of each overnight culture was inoculated in 20 ml (1:100) fresh LB broth plus 
ampicillin and grown to an ODeoo of 0.6. 20 nl of 100 mM IPTG (final concentration 0. ImM) 
was added and cultures were grown overnight. Western analysis confirmed the presence of 
'inducible PMS expression in the presence of inducer molecule (not shown). 

- 'Example 2: Generation of antibiotic resistant bacteria 

To demonstrate the ability to produce antibiotic resistant bacterial strains by inhibiting 
MMR, lO' bacterial cells expressing either the vector (pT7Ea) or pT7PMS 134^/^5 were 
inoculated into 5 ml LB broth plus the appropriate antibiotic concentrations as shown below 
(Table 1) and grown overnight at 37°C with shaking. Antibiotic concentrations were based on 
' 0.5X the minimum inhibitory concenti-ations (MIC) observed to inhibit the growth of bacteria 
constitutively expressing the mar operon (Goldman et al. (1996) Antunicrobial Agents 
Chemother. 40: 1266-1269). Titration analysis found the following amounts to be effective in 
inhibiting bacterial growth in the presence of various compounds. 
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Table 1. Half minimum inhibitory concentrations (MIC) on BL21 cells. 


DRUG 

0.5X MIC rug/ran 

Tetracycline 

4.70 

Nalidixic Acid 

7.10 

Ofloxacin 

0.13 

Norfloxacin 

0.13 

Vancomycin 

250.0 


The next day, cultures were analyzed for cell growHi in the presence or absence of antibiotics. 
Table 2 summarizes typical data from these studies. No growth was observed in bacterial 
control cells (pT7Ea), which had OD levels similar to blank culture. In contrast, significant 
culture growth was observed in pT7PMS134V5 andpT7PMSR3 (not shown) cultures grown 
in all antibiotics tested (Table 2) 


Table 2. Overnight Growth of Drug Resistant Mutants Expressing the PMS2-134. 


Drug 

pTTEa 

pT7PMS134V5 


growth 

Cell# 

growth 

Cell#(X10') 

Tetracycline 


0 

+ 

1.10 

Nalidixic Acid 


0 

+ 

0.97 

Ofloxacin 


0 

+ 

1.20 

Norfloxacin 


0 

+ 

1.40 

Vancomycin 


0 

+ 

ND 


To test the stability of antibiotic resistance, cells were replated and followed for 
growth in the presence of IX MIC concentration of antibiotic. Table 3 shows an example in 
which bacterial cells were inoculated at 1 x lO'' cells/ml and grown for 6 hours in the presence 
of tetracycline (Tet). As shown in Figure 1, pT7Ea control culture did not grow in the 
presence of Tet while pT7PMS134 and p,T7PMSR3 cultures resistant to Tet grew to 
confluence at time 4 hours after inoculation. These data demonstrate the ability to generate 
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antibiotic resistant cultures by blocking MMR and reestablishing genetically stable cultures 
that can be used for gene discovery. 

EXAMPLE 3: Genomic analysis of antibiotic resistant bacteria and target discovery. 

The ability to generate a wide degree of genomic mutation in MMR defective bacteria 
allow for the rapid analysis of the AR host's genome in comparison to the wild-type strain. 
While many methods for mutation analysis exist that are know by those skilled in the art, 
several approaches exist that allow for the screening of unknown genes as well as those that 
exist which are capable of screening for mutants within "candidate" genes that are capable of 
conferring an antiobiotic resistant phenotype. One such method includes the use of in vitro- 
coupled-translation strategies, which is a rapid method that is used to screen for mutations that 
result in truncated polypeptides (Liu et al. (1996) Nat. Med. 2: 169-174; Nicolaides et al. 
(1994) Nature 371: 75-80; Papadopoulos et al. (1994) Science 263:1625-1629; Nicolaides et 
al. (1998) Mol Cell. Biol 18:1635-1641; Alekshun, M.N. and S.B. Levy (1999) /. Bactenol. 
181:3303-3306). 

-In vitro transcription-coupled-translation. 

Linear DNA fragments containing candidate gene sequences were prepared by PGR, 
incorporating sequences for in vitro transcription and translation in the sense primer. The sense 
primer contains the leader sequence 5'-tttaatacgactcactatagggagaccaccatggnnn nnn nnn nnn 
nnn-3' (SEQ ID NO:27) where the series of "n" nuclsotides indicates sequence corresponding 
to the first 5 codons. The antisense primer consists of nucleotide sequences surrounding and 
including the natural stop codon of the gene. DNA fragments are PGR amplified using buffers 
and condions as described (Nicolaides et al. (1995) Genomics 30:195-206). Two to five 
microliters of whole bacteria are added to the PGR reaction mix and reactions are carried out 
at 95°C for 1 minute for one cycle followed by thirty cycles at 95°C for 30 sec, 52°C for 2 
minute and 72°C for 2 minutes. PGR products are then directly added to a rabbiti reticulolysate 
mixture to carry out transcription-coupled-translation (Promega). The reaction mixtures were 
supplemented with [^^S]-methionine for detection purposes. Translation reactions are 
incubated for 2 hours. After the reaction is complete, an equal volume of 2X SDS lysis buffer 


wo 03/012130 


PCT/USOl/23888 


-32- 

is added to the samples, and the samples are boiled and then loaded onto 12% NuPAGE gels 
(Novex). Gels are run at 150V, dried and exposed to autoradiography. Products that are 
smaller than the expected molecular weight of the wild-type protein (as compared to the 
control samples) are then determined to be mutant and DNA fragments are sequenced to 
confirm the presence of a frame-shift/nonsense mutation. This approach has been used to 
identify mutations ia bacterial genes that have been previously been reported to produce 
antibiotic resistance in bacteria. 

Discussion 

The results described above lead to several conclusions. The inhibition of MMR 
results in an increase in hypermutability in bacteria. This activity is due to the inhibition of 
MMR biochemical activity in these hosts. This invention provides a novel method of 
producing antibiotic resistant strains for target discovery and the rational design of novel anti- 
microbial agents to each target identified by generating AR bacteria through the inhibition of 
mismatch repair. 

The disclosures of the following references, as well as the references cited herein, are 
hereby incorporated by reference in their entirety. 
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What is claimed is: 

1. A method for generating antibiotic resistant bacteria comprising the steps of : 
blocking mismatch repair in a bacterium whereby said bacterium becomes 

hypermutable; 

contacting said bacterium with at least one antibiotic; 

selecting said a bacterium that is resistant to said antibiotic; and 

culturing said bacterium; 
thereby generating antibiotic resistant bacteria. 

2. The method of claim 1 wherein said mismatch repair is blocked by introducing a 
dominaBt negative allele of a mismatch repair gene into said bacteriimi. 

3 . The method of claim 2 wherein said dominant negative allele of a mismatch repair gene 

isaPMS'2-134gene. 

4. The method of claim 1 wherein said mismatch repair is blocked by introducing an 
antisense nucleic acid molecule into said bacterium wherein said antisense nucleic acid 
molecule specifically binds to a mismatch repair gene and inhibits mismatch repair in said 
bacterium. 

5. The method of claim 1 wherein said mismatch repair is blocked by exposing said 
bacterium to a compound that inhibits mismatch repair. 

6. The method of claim 5 wherein said compoimd is an anthracene derivative having the 
formula: 

wherein Ri-Rio are independently hydrogen, hydroxyl, amino, alkyl, substituted alkyl, 
alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, O-alkyl, S-alkyl, N-alkyl, 0-alkenyl, 
S-alkenyl, N-alkenyl,0-alkynyl, S-alkynyl, N-alkynyl, aryl, substituted aryl, aryloxy. 
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substituted aryloxy, heteroaryl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, 
alkylaiyloxy, arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidiao, 
carboxy, an alcohol, an amino acid, sulfonate, alkyl sulfonate, CN, NO2, an aldehyde group, 
an ester, an ether, a crown ether, a ketone, an organosulfur compound, an organometallic 
group, a carboxylic acid, an organosilicon or a carbohydrate that optionally contains one or 
more alkylated hydroxyl groups; 

wherein said heteroalkyl, heteroaryl, and substituted heteroaryl contain at least one 
heteroatom that is oxygen, sulfiir, a metal atom, phosphorus, silicon or nitrogen; 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 
alkynyl, substituted aryl, and substituted heteroaryl are halogen, CN, NO2, lower alkyl, aryl, 
heterpaiyl, araUsyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and 
amino; and 

wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl or 
lower aUcyl groups. 

7. The method of claim 6 wherein said compound is selected from the group consisting of 
1,2-duuethylanthracene, 9,10-dimethyl anthracene, 7,8-dimethylanthracene, 9,10- 
diphenylantliracene, 9,10-dihydroxymethylanthracene, 9-hydroxymethyl-lO-methylanthracene, 
dimethylanthracene-l,2-diol, 9-hydroxymethyl-10-methylanthracene-l,2-diol, 9- 
hydroxymethyl- 1 0-methylanthracene-3 ,4-diol, 9 , 1 0-di-m-tolyanfhracene. 

8. The method of claim 6, further comprising exposing said bacterium to a chemical 
mutagen. 

9. The method of claim 8 wherein said chemical mutagen is selected from the group 
consisting of methane sulfonate, dimethyl sulfonate, 0-6-methyl benzadine, ethylnitrosourea, 
ethidium bromide, ethyl methanesulfonate, N-methyl-N'-nitro-N-nitrosoguanidine, 
methylniti-osourea, Tamoxifen, and 8-hydroxyguanine. 


10. 


The method of claim 5 wherein said compound is selected from the group consisting of 
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an ATP analog, a nuclease inhibitor, and a DNA polymerase inhibitor. 

1 1 . The method of claim 1 0 wherein said ATP analog is selected from the group consisting 
of AMP-PNP and ATP[gamma]S. 

12. The method of claim 10 wherein said nuclease inhibitor is selected from the group 
consisting of N-ethylmaleimide, heterodimeric adenine-chain-acridine compounds, 
exonulcease III inhibitors and heliquinomycin. 

1 3 . The method of claim 1 0 wherein said DNA polymerase inhibitor is selected from the 
group consisting of actinomycin D analogs, aphidicolin, l-(2'-Deoxy-2'-fluoro-beta-L- 
arabinofuranosyl)-5-methyluracil, and 2',3'-dideoxyribonucleoside 5'-triphosphates. 

14. The method of claim 1 wherein said antibiotic is a quinilone. 

15; The method of claim 1 wherein said antibiotic is an aminoglycoside. 

1 6; • The method of claim 1 wherein said antibiotic is a magainm. 

17. The method of claim 1 wherein said antibiotic is a defensin. 

1 8. The method of claim 1 wherein said antibiotic is a tetracycline. 

19. The method of claim 1 wherein said antibiotic is a beta-lactam. 

20. The method of claim 1 wherein said antibiotic is a macrolide. 

21 . The method of claim 1 wherein said antibiotic is a lincosamide. 

22. The method of claim 1 wherein said antibiotic is a sulfonamide. 
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23. The method of claim 1 wherein said antibiotic is a chloramphenicol, 

24. The method of claim 1 wherein said antibiotic is a nitrofurantoin. 

25 . The method of claim 1 wherein said antibiotic is an isoniazid. 

26. The method, of claim 1 wherein the step of determining whether said bacterium is 
resistant to said antibiotic comprises analyzing said bacterium for multiantiboitic resistance. 

27. The method of claim 1 further comprising making antibiotic resistant bacteria 
genetically stable. 

28. The method of claim 5 further comprising making antibiotic resistant bacteria 

genetically stable. 

29. The method of claim 28 wherein said antibiotic resistant bacteria are made genetically 
stable by removing the MMR inhibitory molecule. 

30. A method for identifymg a mutant gene conferring antibiotic resistance comprising 
comparing the genome of antibiotic resistant bacterium made by the method of claim 1 to the 
genome of a wild-type strain of said bacterium. 

3 1 . The method of claim 30 wherein the genome of said antibiotic resistant bacterium and 
the genome of said wild-type strain of said bacterium are compared by sequence analysis of the 
entire genomes. 

32. The method of claim 30 wherein the genome of said antibiotic resistant bacterium and 
the genome of said wild-type strain of said bacterium are compared by microarray analysis. 
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33 . The method of claim 30 wherein the genome of said antibiotic resistant bacterium and 
the genome of said wild-type strain of said bacterium are compared by: 

introducing gene fragments from said antibiotic resistant bacterium into flie wild-type 
bacterium, thereby producing mutant bacteria; 

selecting a mutant bacterium with antibiotic resistance; and 

sequencing said gene fragment ftom said mutant bacterium with antibiotic resistance, 
thereby identifying the antibiotic resistance gene. 

34. The method of claim 30 wherein the genome of said antibiotic resistant bacterium and 
the genome of said wild-tj^e bacterium are compared by: 

introducing gene fragments from said wild-tj^e strain of said bacterium into the 

antibiotic resistant strain of said bacterium; 

selecting a mutant bacterium with antibiotic resistance; and 

sequencing said gene fragment from said mutant bacterium, thereby identifying the 

antibiotic resistance gene. 

35. A method of producing an antibiotic resistant bacterium comprising the steps of: 
- culturing bacteria with a natural defect in mismatch repair; 

contacting said bacteria with at least one antibiotic; 
selecting a bacterium among said bacteria resistant to said antibiotic; and 
culturing said bacterium; 
thereby generating antibiotic resistant bacteria. 

36. A method of generating antibiotic resistant bacteria comprising the steps of: 
overexpressing a mismatch repair gene in a bacterium whereby said bacterium becomes 

hypermutable; 

contacting said bacterium with at least one antibiotic; 

determining whether said bacterium is resistant to said antibiotic; and 

culturing said bacterium; 
thereby generating antibiotic resistant bacteria. 
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37. The method of claim 3 6 fiirther comprising making said antibiotic resistant bacteria 
genetically stable. 

38. An antibiotic resistant bacterium produced by the method of claim 1 . 

39. An antibiotic resistant bacterium produced by the method of claim 35. 

40. An antibiotic resistant bacterium produced by the method of claim 36. 

41. An antibiotic resistant bacterium produced by the method of claim 37. 
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SEQUENCE LISTING 

<110=. Wicolaides, Nicholas C 
Sass, Philip M 
Grasso, Luigi M 
Kline, J Bradford 

<120> METHODS FOR GENERATING ANTIBIOTIC RESISTANT MICROBES AND NOVEL 
ANTIBIOTICS 

<130> MOR-0039 

<140> 00/000,000 
<141> 2001-07-25 

<160> 39 

<170> Patentln version 3.1 

<210> 1 

<211> 3218 

<212> DNA 

<213> Saccharomyces cerevisiae 


<400> 1 ^ ^ 

aaataggaat gtgatacctt ctattgcatg caaagatagt gtaggaggcg ctgctattgc 

caaagacttt tgagaccgct tgctgtttca ttatagttga ggagttctcg aagacgagaa 120 

attagcagtt ttcggtgttt agtaatcgcg ctagcatgct aggacaattt aactgcaaaa 180 

ttttgatacg atagtgatag taaatggaag gtaaaaataa catagaccta tcaataagca 

atgtctctca gaataaaagc acttgatgca tcagtggtta acaaaattgc tgcaggtgag 

atcataatat cccccgtaaa tgctctcaaa gaaatgatgg agaattccat cgatgcgaat 

gctacaatga ttgatattct agtcaaggaa ggaggaatta aggtacttca aataacagat 420 

aacggatctg gaattaataa agcagacctg ccaatcttat gtgagcgatt cacgacgtcc 480 

aaattacaaa aattcgaaga tttgagtcag attcaaacgt atggattccg aggagaagct 

ttagccagta tctcacatgt ggcaagagtc acagtaacga caaaagttaa agaagacaga 

tgtgcatgga gagtttcata tgcagaaggt aagatgttgg aaagccccaa acctgttgct 

ggaaaagacg gtaccacgat cctagttgaa gacctttttt tcaatattcc ttctagatta 

agggccttga ggtcccataa tgatgaatac tctaaaatat tagatgttgt cgggcgatac 780 

gccattcatt ccaaggacat tggcttttct tgtaaaaagt tcggagactc taattattct 

ttatcagtta aaccttcata tacagtccag gataggatta ggactgtgtt caataaatct 

gtggcttcga atttaattac ttttcatatc agcaaagtag aagatttaaa cctggaaagc 

gttgatggaa aggtgtgtaa tttgaatttc atatccaaaa agtccatttc attaattttt 

ttcattaata atagactagt gacatgtgat cttctaagaa gagctttgaa cagcgtttac 

tccaattatc tgccaaaggg cttcagacct tttatttatt tgggaattgt tatagatccg 

gcggctgttg atgttaacgt tcacccgaca aagagagagg ttcgtttcct gagccaagat 
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60 


240 
300 
360 


540 
600 
660 
720 


840 
900 
960 
1020 

1080 
1140 
1200 
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gagatcatag agaaaatcgc caatcaattg cacgccgaat tatctgccat tgatacttca 1260 

cgtactttca aggcttcttc aatttcaaca aacaagccag agtcattgat accatttaat 1320 

gacaccatag aaagtgatag gaataggaag agtctccgac aagcccaagt ggtagagaat 1380 

tcatatacga cagccaatag tcaactaagg aaagcgaaaa gacaagagaa taaactagtc 1440 

agaatagatg cttcacaagc taaaattacg tcatttttat cctcaagtca acagttcaac 1500 

tttgaaggat cgtctacaaa gcgacaactg agtgaaccca aggtaacaaa tgtaagccac 1560 

tcccaagagg cagaaaagct gacactaaat gaaagcgaac aaccgcgtga tgccaataca 1620 

atcaatgata atgacttgaa ggatcaacct aagaagaaac aaaagttggg ggattataaa 1680 

gttccaagca ttgccgatga cgaaaagaat gcactcccga tttcaaaaga cgggtatatt 1740 

agagtaccta aggagcgagt taatgttaat cttacgagta tcaagaaatt gcgtgaaaaa 180 0 

gtagatgatt cgatacatcg agaactaaca gacatttttg caaatttgaa ttacgttggg 1860 

gttgtagatg aggaaagaag attagccgct attcagcatg acttaaagct ttttttaata 192 0 

gattacggat ctgtgtgcta tgagctattc tatcagattg gtttgacaga cttcgcaaac 1980 

tttggtaaga taaacctaca gagtacaaat gtgtcagatg atatagtttt gtataatctc 2040 

ctatcagaat ttgacgagtt aaatgacgat gcttccaaag aaaaaataat tagtaaaata 2100 

tgggacatga gcagtatgct aaatgagtac tattccatag aattggtgaa tgatggtcta 2160 

gataatgact taaagtctgt gaagctaaaa tctctaccac tacttttaaa aggctacatt 2220 

ccatctctgg tcaagttacc attttttata tatcgcctgg gtaaagaagt tgattgggag 2280 

gatgaacaag agtgtctaga tggtatttta agagagattg cattactcta tatacctgat 2340 

atggttccga aagtcgatac actcgatgca tcgttgtcag aagacgaaaa agcccagttt 240 0 

ataaatagaa aggaacacat atcctcatta ctagaacacg ttctcttccc ttgtatcaaa 246 0 

cgaaggttcc tggcccctag acacattctc aaggatgtcg tggaaatagc caaccttcca 252 0 

gatctataca aagtttttga gaggtgttaa ctttaaaacg ttttggctgt aataccaaag 2580 

tttttgttta tttcctgagt gtgattgtgt ttcatttgaa agtgtatgcc ctttccfctta 2640 

acgattcatc cgcgagattt caaaggatat gaaatatggt tgcagttagg aaagtatgtc 2700 

agaaatgtat attcggattg aaactcttct aatagttctg aagtcacttg gttccgtatt 2760 

gttttcgtcc tcttcctcaa gcaacgattc ttgtctaagc ttattcaacg gtaccaaaga 2820 

cccgagtcct tttatgagag aaaacatttc atcatttttc aactcaatta tcttaatatc 2880 

attttgtagt attttgaaaa caggatggta aaacgaatca cctgaatcta gaagctgtac 2940 

cttgtcccat aaaagtttta atttactgag cctttcggtc aagtaaacta gtttatctag 3000 

ttttgaaccg aatattgtgg gcagatttgc agtaagttca gttagatcta ctaaaagttg 3 050 

tttgacag'ca gccgattcca caaaaatttg gtaaaaggag atgaaagaga cctcgcgcgt 3120 
Page 2 
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aatggtttgc atcaccatcg gatgtctgtt gaaaaactca ctttttgcat ggaagttatt 3180 
aacaataaga ctaatgatta ccttagaata atgtataa 3218 

<210> 2 
<211> 769 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 2 

Met Ser Leu Arg lie Lys Ala Leu Asp Ala Ser Val Val Asn Lys lie 
15 10 15 

Ala Ala Gly Glu He He He Ser Pro Val Asn Ala Leu Lys Glu Met 
20 25 30 

Met Glu Asn Ser He Asp Ala Asn Ala Thr Met He Asp He Leu Val 

35 40 45 

Lys Glu Gly Gly lie Lys Val Leu Gin He Thr Asp Asn Gly Ser Gly 
50 55 60 

He Asn Lys Ala Asp Leu Pro He Leu Cys Glu Arg Phe Thr Thr Ser 


Lys Leu Gin Lys Phe Glu Asp Leu Ser Gin He Gin Thr Tyr Gly Phe 
85 90 95 


Arg Gly Glu Ala Leu Ala Ser He Ser His Val Ala Arg Val Thr Val 
100 105 110 


Thr Thr Lys Val Lys Glu Asp Arg Cys Ala Trp Arg Val Ser Tyr Ala 
115 120 125 


Glu Gly Lys Met Leu Glu Ser Pro Lys Pro Val Ala Gly Lys Asp Gly 
130 135 140 


Thr Thr He Leu Val Glu Asp Leu Phe Phe Asn He Pro Ser Arg Leu 
145 150 155 160 


Arg Ala Leu Arg Ser His Asn Asp Glu Tyr Ser Lys He Leu Asp Val 
165 170 175 


Val Gly Arg Tyr Ala He His Ser Lys Asp He Gly Phe Ser Cys Lys 
180 185 190 


Lys Phe Gly Asp Ser Asn Tyr Ser Leu Ser Val Lys Pro Ser Tyr Thr 
195 200 205 
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Val Gin Asp Arg lie Arg Thr Val Phe Asn Lys Ser Val Ala Ser Asn 
210 215 220 


Leu lie Thr Phe His lie Ser Lys Val Glu Asp Leu Asn Leu Glu Ser 
225 230 235 240 


Val Asp Gly Lys Val Cys Asn Leu Asn Phe lie Ser Lys Lys Ser lie 
245 250 255 


Ser Leu lie Phe Phe lie Asn Asn Arg Leu Val Thr Cys Asp Leu Leu 
260 265 270 


Arg Arg Ala Leu Asn Ser Val Tyr Ser Asn Tyr Leu Pro Lys Gly Phe 
275 280 285 


Arg Pro Phe He Tyr Leu Gly He Val He Asp Pro Ala Ala Val Asp 
290 295 300 


Val Asn Val His Pro Thr Lys Arg Glu Val Arg Phe Leu Ser Gin Asp 
305 310 315 320 


Glu He He Glu Lys He Ala Asn Gin Leu His Ala Glu Leu Ser Ala 
325 330. 335 


He Asp Thr Ser Arg Thr Phe Lys Ala Ser Ser He Ser Thr Asn Lys 
340 345 350 


Pro Glu Ser Leu He Pro Phe Asn Asp Thr He Glu Ser Asp Arg Asn 
355 360 365 


Arg Lys Ser Leu Arg Gin Ala Gin Val Val Glu Asn Ser Tyr Thr Thr 
370 375 380 


Ala Asn Ser Gin Leu Arg Lys Ala Lys Arg Gin Glu Asn Lys Leu Val 
385 390 395 400 


Arg He Asp Ala Ser Gin Ala Lys He Thr Ser Phe Leu Ser Ser Ser 
405 410 415 


Gin Gin Phe Asn Phe Glu Gly Ser Ser Thr Lys Arg Gin Leu Ser Glu 
420 425 430 


Pro Lys Val Thr Asn Val Ser His Ser Gin Glu Ala Glu Lys Leu Thr 
435 440 . 445 


Leu Asn Glu Ser Glu Gin Pro Arg Asp Ala Asn Thr He Asn Asp Asn 
450 455 460 
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Asp Leu Lys Asp Gin Pro Lys Lys Lys Gin Lys Leu Gly Asp Tyr Lye 
465 470 475 480 

Val Pro Ser lie Ala Asp Asp Glu Lys Asn Ala Leu Pro lie Ser Lys 
485 490 495 

Asp Gly Tyr lie Arg Val Pro Lys Glu Arg Val Asn Val Asn Leu Thr 
500 505 510 

Ser He Lys Lys Leu Arg Glu Lys Val Asp Asp Ser He His Arg Glu 
515 520 525 

Leu Thr Asp He Phe Ala Asn Leu Asn Tyr Val Gly Val Val Asp Glu 

530 535 540 

Glu Arg Arg Leu Ala Ala He Gin His Asp Leu Lys Leu Phe Leu He 
545 550 555 560 

Asp Tyr Gly Ser Val Cys Tyr Glu Leu Phe Tyr Gin He Gly Leu Thr 
565 570 575 


Asp Phe Ala Asn Phe Gly Lys He Asn Leu Gin Ser Thr Asn Val Ser 
580 585 590 

Asp Asp He Val Leu Tyr Asn Leu Leu Ser Glu Phe Asp Glu Leu Asn 
595 SOO 605 

Asp Asp Ala Ser Lys Glu Lys He He Ser Lys He Trp Asp Met Ser 
610 615 620 

Ser Met Leu Asn Glu Tyr Tyr Ser He Glu Leu Val Asn Asp Gly Leu 
625 630 635 640 

Asp Asn Asp Leu Lys Ser Val Lys Leu Lys Ser Leu Pro Leu Leu Leu 
645 650 655 


Lys Gly Tyr He Pro Ser Leu Val Lys Leu Pro Phe Phe He Tyr Arg 

660 665 670 

Leu Gly Lys Glu Val Asp Trp Glu Asp Glu Gin Glu Cys Leu Asp Gly 
675 680 685 


He Leu Arg Glu He Ala Leu Leu Tyr He Pro Asp Met Val Pro Lys 
690 695 700 

Val Asp Thr Leu Asp Ala Ser Leu Ser Glu Asp Glu Lys Ala Gin Phe 
705 710 715 720 
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He Asn Arg Lys Glu His He Ser Ser Leu Leu Glu His Val Leu Phe 
725 730 735 

Pro Cys He Lys Arg Arg Phe Leu Ala Pro Arg His He Leu Lys Asp 
740 745 750 

Val Val Glu He Ala Asn Leu Pro Asp Leu Tyr Lye Val Phe Glu Arg 
755 760 765 

Cys 

<210> 3 

<211> 3056 

<212> DNA 

<213> Mus rausculus 

<400> 3 

gaattccggt gaaggtcctg aagaatttcc agattcctga gtatcattgg aggagacaga 60 

taacctgtcg tcaggtaacg atggtgtata tgcaacagaa atgggtgttc ctggagacgc 120 

gtcttttccc gagagcggca ccgcaactct cccgcggtga ctgtgactgg aggagtcctg 180 

catccatgga gcaaaccgaa ggcgtgagta cagaatgtgc taaggccatc aagcctatfcg 240 

atgggaagtc agtccatcaa atttgttctg ggcaggtgat actcagttta agcaccgctg 300 

tgaaggagtt gatagaaaat agtgtagatg ctggtgctac tactattgat ctaaggctta 360 

aagactatgg ggtggacctc attgaagttt cagacaatgg atgtggggta gaagaagaaa 420 

actttgaagg tctagctctg aaacatcaca catctaagat tcaagagttt gccgacctca 480 

cgcaggttga aactttcggc tttcgggggg aagctctgag ctctctgtgt gcactaagtg 540 

atgtcactat atctacctgc cacgggtctg caagcgttgg gactcgactg gtgtttgacc 600 

ataatgggaa aatcacccag aaaactccct acccccgacc taaaggaacc acagtcagtg 660 

tgcagcactt attttataca ctacccgtgc gttacaaaga gtttcagagg aacattaaaa 720 

aggagtattc caaaatggtg caggtcttac aggcgtactg tatcatctca gcaggcgtcc 780 

gtgtaagctg cactaatcag ctcggacagg ggaagcggca cgctgtggtg tgcacaagcg 840 

gcacgtctgg catgaaggaa aatatcgggt ctgtgtttgg ccagaagcag ttgcaaagcc 900 

tcattccttt tgttcagctg ccccctagtg acgctgtgtg tgaagagtac ggcctgagca 960 

cttcaggacg ccacaaaacc ttttctacgt ttcgggcttc atttcacagt gcacgcacgg 102 0 

cgccgggagg agtgcaacag acaggcagtt tttcttcatc aatcagaggc cctgtgaccc 1080 

agcaaaggtc tctaagcttg tcaatgaggt tttatcacat gtataaccgg catcagtacc 1140 

catttgtcgt ccttaacgtt tccgttgact cagaatgtgt ggatattaat gtaactccag 1200 

ataaaaggca aattctacta caagaagaga agctattgct ggccgtttta aagacctcct 1260 
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tgataggaat gtttgacagt gatgcaaaca agcttaatgt caaccagcag ccactgctag 
atgttgaagg taacttagta aagctgcata ctgcagaact agaaaagcct gtgccaggaa 
agcaagataa ctctccttca ctgaagagca cagcagacga gaaaagggta gcatccatct 
ccaggctgag agaggccttt tctcttcatc ctactaaaga gatcaagtct aggggtccag 
agactgctga actgacacgg agttttccaa gtgagaaaag gggcgtgtta tcctcttatc 
cttcagacgt catctcttac agaggcctcc gtggctcgca ggacaaattg gtgagtccca 
cggacagccc tggtgactgt atggacagag agaaaataga aaaagactca gggctcagca 
gcacatcagc tggctctgag gaagagttca gcaccccaga agtggccagt agctttagca 
gtgactataa cgtgagctcc ctagaagaca gaccttctca ggaaaccata aactgtggtg 
acctggactg ccgtcctcca ggtacaggac agtccttgaa gccagaagac catggatatc 
aatgcaaagc tctacctcta gctcgtctgt cacccacaaa tgccaagcgc ttcaagacag 
aggaaagacc ctcaaatgtc aacatttctc aaagattgcc tggtcctcag agcacctcag 
cagctgaggt cgatgtagcc ataaaaatga ataagagaat cgtgctcctc gagttctctc 
tgagttctct agctaagcga atgaagcagt tacagcacct aaaggcgcag aacaaacatg 
aactgagtta cagaaaattt agggccaaga tttgccctgg agaaaaccaa gcagcagaag 
atgaactcag aaaagagatt agtaaatcga tgtttgcaga gatggagatc ttgggtcagt 
ttaacctggg atttatagta accaaactga aagaggacct cttcctggtg gaccagcatg 
ctgcggatga gaagtacaac tttgagatgc tgcagcagca cacggtgctc caggcgcaga 
ggctcatcac accccagact ctgaacttaa ctgctgtcaa tgaagctgta ctgatagaaa 
atctggaaat attcagaaag aatggctttg actttgtcat tgatgaggat gctccagtca 
ctgaaagggc taaattgatt tccttaccaa ctagtaaaaa ctggaccttt ggaccccaag 
atatagatga actgatcttt atgttaagtg acagccctgg ggtcatgtgc cggccctcac 
gagtcagaca gatgtttgct tccagagcct gtcggaagtc agtgatgatt ggaacggcgc 
tcaatgcgag cgagatgaag aagctcatca cccacatggg tgagatggac cacccctgga 
actgccccca cggcaggcca accatgaggc acgttgccaa tctggatgtc atctctcaga 
actgacacac cccttgtagc atagagttta ttacagattg ttcggtttgc aaagagaagg 
ttttaagtaa tctgattatc gttgtacaaa aattagcatg ctgctttaat gtactggatc 
catttaaaag cagtgttaag gcaggcatga tggagtgttc ctctagctca gctacttggg 
tgatccggtg ggagctcatg tgagcccagg actttgagac cactccgagc cacattcatg 
agactcaatt caaggacaaa aaaaaaaaga tatttttgaa gccttttaaa aaaaaa 


1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3056 


<210> 4 
<211> 859 
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<40O> 4 

Met Glu Qln Thr Glu Gly Val Ser Thr Glu Cys Ala Lys Ala lie Lys 
15 10 15 


Pro lie Asp Gly Lys Ser Val His Gin lie Cys Ser Gly. Gin Val lie 


Leu Ser Leu Ser Thr Ala Val Lys Glu Leu lie Glu Asn Ser Val Asp 
35 40 45 


Ala Gly Ala Thr Thr He Asp Leu Arg Leu Lys Asp Tyr Gly Val Asp 


Leu He Glu Val Ser Asp Asn Gly Cys Gly Val Glu Glu Glu Asn Phe 


Glu Gly Leu Ala Leu Lys His His Thr Ser Lys He Gin Glu Phe Ala 
85 90 95 


lie He Ser Ala Gly Val Arg Val Ser Cys Thr Asn Gin Leu Gly Gin 

195 200 205 


Gly Lys Arg His Ala Val Val Cys Thr Ser Gly Thr Ser Gly Met Lys 
210 215 220 


Glu Asn He Gly Ser Val Phe Gly Gin Lys Gin Leu Gin Ser Leu He 
225 230 235 240 
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Pro Phe Val Gin Leu Pro Pro Ser Asp Ala Val Cys Glu Glu Tyr Gly 
245 250 255 


Leu Ser Thr Ser Gly Arg His Lys Thr Phe Ser Thr Phe Arg Ala Ser 
260 265 270 


Phe His Ser Ala Arg Thr Ala Pro Gly Gly Val Gin Gin Thr Gly Ser 
275 280 285 


Phe Ser Ser Ser He Arg Gly Pro Val Thr Gin Gin Arg Ser Leu Ser 
290 295 300 


Leu Ser Met Arg Phe Tyr His Met Tyr Asn Arg His Gin Tyr Pro Phe 

305 310 315 320 


Val Val Leu Asn Val Ser Val Asp Ser Glu Cys Val Asp He Asn Val 
325 330 335 


Ala Val Leu Lys Thr Ser Leu He Gly Met Phe Asp Ser Asp Ala Asn 
355 360 365 


Lys Leu Asn Val Asn Gin Gin Pro Leu Leu Asp Val Glu Gly Asn Leu 
370 375 380 


Val Lys Leu His Thr Ala Glu Leu Glu Lys Pro Val Pro Gly Lys Gin 
385 390 395 400 


Asp Asn Ser Pro Ser Leu Lys Ser Thr Ala Asp Glu Lys Arg Val Ala 
405 410 415 


Ser He Ser Arg Leu Arg Glu Ala Phe Ser Leu His Pro Thr Lys Glu 
420 425 430 


He Lys Ser Arg Gly Pro Glu Thr Ala Glu Leu Thr Arg Ser Phe Pro 
435 440 445 


Ser Glu Lys Arg Gly Val Leu Ser Ser Tyr Pro Ser Asp Val He Ser 
450 455 460 


Tyr Arg Gly Leu Arg Gly Ser Gin Asp Lys Leu Val Ser Pro Thr Asp 
465 470 475 480 


Ser Pro Gly Asp Cys Met Asp Arg Glu Lys He Glu Lys Asp Ser Gly 
485 490 495 
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Leu Ser Ser Thr Ser Ala 61y Ser Glu Glu Glu Phe Ser Thr Pro Glu 
500 505 510 


Arg Pro Ser Gin Glu Thr lie Asn Cys Gly Asp Leu Asp Cys Arg Pro 
530 535 540 


Pro Gly Thr Gly Gin Ser Leu Lys Pro Glu Asp His Gly Tyr Gin Cys 

545 550 555 560 


Lys Thr Glu Glu Arg Pro Ser Asn Val Asn lie Ser Gin Arg Leu Pro 
580 585 590 


Arg Met Lys Gin Leu Gin His Leu Lys Ala Gin Asn Lys His Glu Leu 
625 630 635 640 


Ser "Syr Arg Lys Phe Arg Ala Lys He Cys Pro Gly Glu Asn Gin Ala 

645 650 655 

Ala Glu Asp Glu Leu Arg Lys Glu He Ser Lys Ser Met Phe Ala Glu 

660 665 670 


Met Glu He Leu Gly Gin Phe Asn Leu Gly Phe He Val Thr Lys Leu 
675 680 685 


Lys Glu Asp Leu Phe Leu Val Asp Gin His Ala Ala Asp Glu Lys Tyr 
690 695 700 


Asn Phe Glu Met Leu Gin Gin His Thr Val Leu Gin Ala Gin Arg Leu 
705 710 715 720 


He Glu Asn Leu Glu He Phe Arg Lys Asn Gly Phe Asp Phe Val He 
740 745 750 
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Asp Glu Asp Ala Pro Val Thr Glu Arg Ala Lys Leu lie Ser Leu Pro 
755 760 765 

Thr Ser Lys Asn Trp Thr Phe Gly Pro Gin Asp lie Asp Glu Leu lie 
770 775 780 

Phe Met Leu Ser Asp Ser Pro Gly Val Met Cys Arg Pro Ser Arg Val 
785 790 795 800 

Arg Gin Met Phe Ala Ser Arg Ala Cys Arg Lys Ser Val Met lie Gly 
805 810 815 

Thr Ala Leu Asn Ala Ser Glu Met Lys Lys Leu lie Thr His Met Gly 

B20 825 830 

Glu Met Asp His Pro Trp Asn Cys Pro His Gly Arg Pro Thr Met Arg 
835 840 845 

His Val Ala Asn Leu Asp Val lie Ser Gin Asn 


<210> 5 

<211> 2771 

<212> DNA 

<213> Homo sapiens 

<400> 5 

cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 60 

aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 

ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 

aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 24 0 

tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 

caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 360 

tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 420 

actcgactga tgtttgatca caatgggaaa attatccaga aaacccccta cccccgcccc 480 

agagggacca cagtcagcgt gcagcagtta ttttccacac tacctgtgcg ccataaggaa 540 

tttcaaagga atattaagaa ggagtatgcc aaaatggtcc aggtcttaca tgcatactgt 600 

atcatttcag caggcatccg tgtaagttgc accaatcagc ttggacaagg aaaacgacag 660 

cctgtggtat gcacaggtgg aagccccagc ataaaggaaa atatcggctc tgtgtttggg 720 

cagaagcagt tgcaaagcct cattcctttt gttcagctgc cccctagtga ctccgtgtgt 78 0 

gaagagtacg gtttgagctg ttcggatgct ctgcataatc ttttttacat ctcaggtttc 840 

atttcacaat gcacgcatgg agttggaagg agttcaacag acagacagtt tttctttato 900 
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aaccggcggc cttgtgaccc agcaaaggtc tgcagactcg tgaatgaggt ctaccacatg 960 

tataatcgac accagtatcc atttgttgtt cttaacattt ctgttgattc agaatgcgtt 1020 

gatatcaatg ttactccaga taaaaggcaa attttgctac aagaggaaaa gcttttgttg 1080 

gcagttttaa agacctcttt gataggaatg tttgatagtg atgtcaacaa gctaaatgtc 1140 

agtcagcagc cactgctgga tgttgaaggt aacttaataa aaatgcatgc agcggatttg 1200 

gaaaagccca tggtagaaaa gcaggatcaa tcaccttcat taaggactgg agaagaaaaa 1260 

aaagacgtgt ccatttccag actgcgagag gccttttctc ttcgtcacac aacagagaac 1320 

aagcctcaca gcccaaagac tccagaacca agaaggagcc ctctaggaca gaaaaggggt 138 0 

atgctgtctt ctagcacttc aggtgccatc tctgacaaag gcgtcctgag acctcagaaa 1440 

gaggcagtga gttccagtca cggacccagt gaccctacgg acagagcgga ggtggagaag 1500 

gactcggggc acggcagcac ttccgtggat tctgaggggt tcagcatccc agacacgggc 1560 

agtcactgca gcagcgagta tgcggccagc tccccagggg acaggggctc gcaggaacat 152 0 

gtggactctc aggagaaagc gcctgaaact gacgactctt tttcagatgt ggactgccat .1580 

tcaaaccagg aagataccgg atgtaaattt cgagttttgc ctcagccaac taatctcgca 1740 

accccaaaca caaagcgttt taaaaaagaa gaaattcttt ccagttctga catttgtcaa 1800 

aagttagtaa atactcagga catgtcagcc tctcaggttg atgtagctgt gaaaattaat 1860 

aagaaagttg tgcccctgga cttttctatg agttctttag ctaaacgaat aaagcagtta 1920 

catcatgaag cacagcaaag tgaaggggaa cagaattaca ggaagtttag ggcaaagatt 1980 

tgtcctggag aaaatcaagc agccgaagat gaactaagaa aagagataag taaaacgatg 2040 

tttgcagaaa tggaaatcat tggtcagttt aacctgggat ttataataac caaactgaat 2100 

gaggatatct tcatagtgga ccagcatgcc acggacgaga agtataactt cgagatgctg 21S0 

cagcagcaca ccgtgctcca ggggcagagg ctcatagcac ctcagactct caacttaact 2220 

gctgttaatg aagctgttct gatagaaaat ctggaaatat ttagaaagaa tggctttgat 22B0 

tttgttatcg atgaaaatgc tccagtcact gaaagggcta aactgatttc cttgccaact 2340 

agtaaaaact ggaccttcgg accccaggac gtcgatgaac tgatcttcat gctgagcgac 2400 

agccctgggg tcatgtgccg gccttcccga gtcaagcaga tgtttgcctc cagagcctgc 2460 

cggaagtcgg tgatgattgg gactgctctt aacacaagcg agatgaagaa actgatcacc 2520 

cacatggggg agatggacca cccctggaac tgtccccatg gaaggccaac catgagacac 2580 

atcgccaacc tgggtgtcat ttctcagaac tgaccgtagt cactgtatgg aataattggt 2S4 0 

tttatcgcag atttttatgt tttgaaagac agagtcttca ctaacctttt ttgttttaaa 2700 

atgaaacctg ctacttaaaa aaaatacaca tcacacccat ttaaaagtga tcttgagaac 2760 

cttttcaaac c 2771 
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<210> 6 

<211> 932 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Lys Gin Leu Pro Ala Ala Thr Val Arg Leu Leu Ser Ser Ser Gin 
15 10 15 

He He Thr Ser Val Val Ser Val Val Lys Glu Leu He Glu Asn Ser 
20 25 30 

Leu Asp Ala Gly Ala Thr Ser Val Asp Val Lys Leu Glu Asn Tyr Gly 

35 40 45 

Phe Asp Lys lie Glu Val Arg Asp Asn Gly Glu Gly He Lys Ala Val 
50 55 60 

Asp Ala Pro Val Met Ala Met Lys Tyr Tyr Thr Ser Lys He Asn Ser 


His Glu Asp Leu Glu Asn Leu Thr Thr Tyr Gly Phe Arg Gly Glu Ala 
85 90 95 

Leu Gly Ser He Cys Cys He Ala Glu Val Leu He Thr Thr Arg Thr 
100 105 110 

Ala Ala Asp Asn Phe Ser Thr Gin Tyr Val Leu Asp Gly Ser Gly His 
115 120 125 

He Leu Ser Gin Lys Pro Ser His Leu Gly Gin Gly Thr Thr Val Thr 
130 135 140 


Ala Leu Arg Leu Phe Lys Asn Leu Pro Val Arg Lys Gin Phe Tyr Ser 
145 150 155 160 

Thr Ala Lys Lys Cys Lys Asp Glu He Lys Lys He Gin Asp Leu Leu 
165 170 175 


Met Ser Phe Gly He Leu Lys Pro Asp Leu Arg He Val Phe Val His 
180 185 190 


Asn Lys Ala Val He Trp Gin Lys Ser Arg Val Ser Asp His Lys Met 
195 200 205 


Ala Leu Met Ser Val Leu Gly Thr Ala Val Met Asn Asn Met Glu Ser 
210 215 220 
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Phe Gin Tyr His Ser Glu Glu Ser Gin lie Tyr Leu Ser Gly Phe Leu 
225 230 235 240 


Pro Lys Cys Asp Ala Asp His Ser Phe Thr Ser Leu Ser Thr Pro Glu 
245 250 255 


Arg Ser Phe lie Phe lie Asn Ser Arg Pro Val His Gin Lys Asp lie 
260 265 270 

Leu Lys Leu lie Arg His His Tyr Asn Leu Lys Cys Leu Lys Glu Ser 

275 280 285 


Thr Arg Leu Tyr Pro Val Phe Phe Leu Lys lie Asp Val Pro Thr Ala 
290 295 300 


Asp Val Asp Val Asn Leu Thr Pro Asp Lys Ser Gin Val Leu Leu Gin 
305 310 315 320 


Asn Lys Glu Ser Val Leu lie Ala Leu Glu Asn Leu Met Thr Thr Cys 
325 330 335 


Tyr Gly Pro Leu Pro Ser Thr Asn Ser Tyr Glu Asn Asn Lys Thr Asp 
340 345 350 


Val Ser Ala Ala Asp He Val Leu Ser Lys Thr Ala Glu Thr Asp Val 
355 360 365 


Leu Phe .^sn Lys Val Glu Ser Ser Gly Lys Asn Tyr Ser Asn Val Asp 
370 375 380 


Thr Ser Val He Pro Phe Gin Asn Asp Met His Asn Asp Glu Ser Gly 
385 390 395 400 


Lys Asn Thr Asp Asp Cys Leu Asn His Gin He Ser He Gly Asp Phe 
405 410 415 


Gly Tyr Gly His Cys Ser Ser Glu He Ser Asn He Asp Lys Asn Thr 
420 425 430 


Lys Asn Ala Phe Gin Asp He Ser Met Ser Asn Val Ser Trp Glu Asn 

435 440 445 


Ser Gin Thr Glu Tyr Ser Lys Thr Cys Phe He Ser Ser Val Lys His 
450 455 460 


Thr Gin Ser Glu Asn Gly Asn Lys Asp His He Asp Glu Ser Gly Glu 
465 470 475 480 
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Asn Glu Glu Glu Ala Gly Leu Glu Asn Ser Ser Glu lie Ser Ala Asp 
485 490 495 

Glu Trp Ser Arg Gly Asn lie Leu Lys Asn Ser Val Gly Glu Asn He 

500 505 510 


Glu Pro Val Lys He Leu Val Pro Glu Lys Ser Leu Pro Cys Lys Val 
515 520 525 

Ser Asn Asn Asn Tyr Pro He Pro Glu Gin Met Asn Leu Asn Glu Asp 
530 535 540 

Ser Cys Asn Lys Lys Ser Asn Val He Asp Asn Lys Ser Gly Lys Val 
545 550 555 560 


Thr Ala Tyr Asp Leu Leu Ser Asn Arg Val He Lys Lys Pro Met Ser 
565 570 575 

Ala Ser Ala Leu Phe Val Gin Asp His Arg Pro Gin Phe Leu 'He Glu 
580 585 590 

Asn Pro Lys Thr Ser Leu Glu Asp Ala Thr Leu Gin He Glu Glu Leu 
595 600 605 

Trp Lys Thr Leu Ser Glu Glu Glu Lys Leu Lys Tyr Glu' Glu Lys Ala 
610 615 620 

Thr Lys Asp Leu Glu Arg Tyr Asn Ser Gin Met Lys Arg Ala He Glu 
625 630 635 640 

Gin Glu Ser Gin Met Ser Leu Lys Asp Gly Arg Lys Lys He Lys Pro 
645 650 655 

Thr Ser Ala Trp Asn Leu Ala Gin Lys His Lys Leu Lys Thr Ser Leu 
660 665 570 

Ser Asn Gin Pro Lys Leu Asp Glu Leu Leu Gin Ser Gin He Glu Lys 
675 680 685 

Arg Arg Ser Gin Asn He Lys Met Val Gin He Pro Phe Ser Met Lys 

690 695 700 

Asn Leu Lys He Asn Phe Lys Lys Gin Asn Lys Val Asp Leu Glu Glu 
705 710 715 720 

Lys Asp Glu Pro Cys Leu He His Asn Leu Arg Phe Pro Asp Ala Trp 
725 730 735 
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Leu Met Thr Ser Lys Thr Glu Val Met Leu Leu Asn Pro Tyr Arg Val 
740 745 750 


Glu Glu Ala Leu Leu Phe Lys Arg Leu Leu, Glu Asn His Lys Leu Pro 
755 760 765 


Gly Ser His Tyr Leu Asp Val Leu Tyr Lys Met Thr Ala Asp Asp Gin 

785 790 795 800 


Leu Glu He Glu Gly Met Ala Asn Cys Leu Pro Phe Tyr Gly Val Ala 
835 840 845 


Asp Leu Lys Glu He Leu Asn Ala He Leu Asn Arg Asn Ala Lys Glu 
850 855 860 


Ala Val Arg Leu Ser Arg Gin Leu Pro Met Tyr Leu Ser Lys Glu Asp 
885 890 895 


Pro Glu Thr Thr 
930 

<210> 7 

<211> 3063 

<212> DNA 

<213> Homo sapiens 

<400> 7 

ggcacgagtg gctgottgcg gctagtggat ggtaattgcc tgcctcgcgc tagcagcaag 60 
ctgctctgtt aaaagcgaaa atgaaacaat tgcctgcggc aacagttcga ctcctttcaa 120 
gttctcagat catcacttcg gtggtcagtg ttgtaaaaga gcttattgaa aactccttgg 180 
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atgchggtgc cacaagcgta gatgttaaac tggagaacta tggatttgat aaaattgagg 24 0 

tgcgagataa cggggagggt atcaaggctg ttgatgcacc tgtaatggca atgaagtact 3 00 

acacctcaaa aataaatagt catgaagatc ttgaaaattt gacaacttac ggttttcgtg 360 

gagaagcctt ggggtcaatt tgttgtatag ctgaggtttt aattacaaca agaacggctg 420 

ctgataattt tagcacccag tatgttttag atggcagtgg ccacatactt tctcagaaac 480 

cttcacatct tggtcaaggt acaactgtaa ctgctttaag attatttaag aatctacctg 540 

taagaaagca gttttactca actgcaaaaa aatgtaaaga tgaaataaaa aagatccaag 600 

atctcctcat gagctttggt atccttaaac ctgacttaag gattgtcttt gtacataaca 660 

aggcagttat ttggcagaaa agcagagtat cagatcacaa gatggctctc atgtcagttc 720 

tgagsactgc tgttatgaac aatatggaat cctttcagta ccactctgaa gaatctcaga 780 

tttatctcag tggatttctt ccaaagtgtg atgcagacca ctctttcact agtctfctcaa 840 

caccagaaag aagtttcatc ttcataaaca gtcgaccagt acatcaaaaa gatatcttaa 900 

agttaatccg acatcattac aatctgaaat gcctaaagga atctactcgt ttgtatcctg 960 

ttttctttct gaaaatcgat gttcctacag ctgatgttga tgtaaattta acaccagata 1020 

aaagccaagt attattacaa aataaggaat ctgttttaat tgctcttgaa aatctgatga 1080 

cgacttgtta tggaccatta cctagtacaa attcttatga aaataataaa acagatgttt 1140 

ccgcagctga catcgttctt agtaaaacag cagaaacaga tgtgcttttt aataaagtgg 1200 

aatcatctgg aaagaattat tcaaatgttg atacttcagt cattccattc caaaatgata 1260 

tgcataatga tgaatctgga aaaaacactg atgattgttt aaatcaccag ataagtattg 1320 

gtgactttgg ttatggtcat tgtagtagtg aaatttctaa cattgataaa aacactaaga 1380 

atgcafcttca ggacatttca atgagtaatg tatcatggga gaactctcag acggaatata 1440 

gtaaaacttg ttttataagt tccgttaagc acacccagtc agaaaatggc aataaagacc 150 0 

atatagatga gagtggggaa aatgaggaag aagcaggtct tgaaaactct tcggaaattt 1560 

ctgcagatga gtggagcagg ggaaatatac ttaaaaattc agtgggagag aatattgaac 1620 

ctgtgaaaat tttagtgcct gaaaaaagtt taccatgtaa agtaagtaat aataattatc 1680 
caatccctga acaaatgaat cttaatgaag attcatgtaa caaaaaatca aatgtaatag , 1740 

ataataaatc tggaaaagtt acagcttatg atttacttag caatcgagta atcaagaaac 1800 

ccatgtcagc aagtgctctt tttgttcaag atcatcgtcc tcagtttctc atagaaaatc 1860 

ctaagactag tttagaggat gcaacactac aaattgaaga actgtggaag acattgagtg 1920 

aagaggaaaa actgaaatat gaagagaagg ctactaaaga cttggaacga tacaatagtc 1980 

aaatgaagag agccattgaa caggagtcac aaatgtcact aaaagatggc agaaaaaaga 2040 

taaaacccac cagcgcatgg aatttggccc agaagcacaa gttaaaaacc tcattatcta 2100 
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atcaaccaaa 

acttgatgaa ctccttcagt cccaaattga aaaaagaagg agtcaaaata 

2160 

ttaaaatggt 

acagatcGcc ttttctatga aaaacttaaa aataaatttt aagaaacaaa 

2220 

acaaagttga 

cttagaagag aaggatgaac cttgcttgat ccacaatctc aggtttcctg 

2280 

atgcatggct 

aatgacatcc aaaacagagg taatgttatt aaatccatat agagtagaag 

2340 

aagccctgct 

atttaaaaga cttcttgaga atcataaact tcctgcagag ccactggaaa 

2400 

agccaattat 

gttaacagag agtcttttta atggatctca ttatttagac gttttatata 

2460 

aaatgacagc 

agatgaccaa agatacagtg gatcaactta cctgtctgat cctcgtctta 

2520 

cagcgaatgg 

tttcaagata aaattgatac caggagtttc aattactgaa aattacttgg 

2580 

aaatagaagg 

aatggctaat tgtctcccat tctatggagt agcagattta aaagaaattc 

2640 

ttaatgctat 

attaaacaga aatgcaaagg aagtttatga atgtagacct cgcaaagtga 

2700 

taagttattt 

agagggagaa gcagtgcgtc tatccagaca attacccatg tacttatcaa 

27S0 

aagaggacat 

ccaagacatt atctacagaa tgaagcacca gtttggaaat gaaattaaag 

2820 

agtgtgttca 

tggtcgccca ttttttcatc atttaaccta tcttccagaa actacatgat 

2880 

taaatatgtt 

taagaagatt agttaccatt gaaattggtt ctgtcataaa acagcatgag 

2940 

tctggtttta 

aattatcttt gtattatgtg tcacatggtt attttttaaa tgaggattca 

3000 

ctgacttgtt 

tttatattga aaaaagttcc acgtattgta gaaaacgtaa ataaactaat 

3060 

aac 


3063 


<210> 8 

<211> 932 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Lys Gin Leu Pro Ala Ala Thr Val Arg Leu Leu Ser Ser Ser Gin 
15 10 15 

lie lie Thr Ser Val Val Ser Val Val Lys Glu Leu He Glu Asn Ser 
20 25 30 

Leu Asp Ala Gly Ala Thr Ser Val Asp Val Lys Leu Glu Asn Tyr Gly 
35 40 45 

Phe Asp Lys He Glu Val Arg-Asp Asn Gly Glu Gly He Lys Ala Val 
50 55 60 

Asp Ala Pro Val Met Ala Met Lys Tyr Tyr Thr Ser Lys He Asn Ser 
65 70 75 80 


His Glu Asp Leu Glu Asn Leu Thr Thr Tyr Gly Phe Arg Gly Glu Ala 
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Ala Ala Asp Asn Phe Ser Thr Gin Tyr Val Leu Asp Gly Ser Gly His 
115 120 125 


Ala Leu Arg Leu Phe Lys Asn Leu Pro Val Arg Lye Gin Phe Tyr Ser 
145 150 155 160 


Met Ser Phe Gly lie Leu Lys Pro Asp Leu Arg He Val Phe Val His 
180 185 190 


Ala Leu Met Ser Val Leu Gly Thr Ala Val Met Asn Asn Met Glu Ser 
210 215 220 


Phe Gin Tyr His Ser Glu Glu Ser Gin He Tyr Leu Ser Gly Phe Leu 
225 230 235 240 


Pro Lys Cys Asp Ala Asp His Ser Phe Thr Ser Leu Ser Thr Pro Glu 
245 250 255 


Leu Lys Leu He Arg His His Tyr Asn Leu Lys Cys Leu Lys Glu Ser 
275 280 285 


Thr Arg Leu Tyr Pro Val Phe Phe Leu Lys He Asp Val Pro Thr Ala 
290 295 300 


Asp Val Asp Val Asn Leu Thr Pro Asp Lys Ser Gin Val Leu Leu Gin 
305 310 315 320 


Asn Lys Glu Ser Val Leu He Ala Leu Glu Asn Leu Met Thr Thr Cys 
325 330 335 


Tyr Gly Pro Leu Pro Ser Thr Asn Ser Tyr Glu Asn Asn Lys Thr Asp 
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Val Ser Ala Ala Asp He Val Leu Ser Lys Thr Ala Glu Thr Asp Val 
355 3S0 365 


Leu Phe Asn Lys Val Glu Ser Ser Gly Lys Asn Tyr Ser Asn Val Asp 
370 375 380 

Thr Ser Val He Pro Phe Gin Asn Asp Met His Asn Asp Glu Ser Gly 
385 390 395 400 


Lys Asn Thr Asp Asp Cys Leu Asn His Gin He Ser He Gly Asp Phe 
405 410 415 


Gly Tyr Gly His Cys Ser Ser Glu He Ser Asn He Asp Lys Asn Thr 
420 425 430 


Lys Asn Ala Phe Gin Asp He Ser Met Ser Asn Val Ser Trp Glu Asn 
435 440 445 


Ser Gin Thr Glu Tyr Ser Lys Thr Cys Phe He Ser Ser Val Lys His 
450 455 460 

Thr Gin Ser Glu Asn Gly Asn Lys Asp His He Asp Glu Ser Gly Glu 
465 470 475 480 

Asn Glu Glu Glu Ala Gly Leu Glu Asn Ser Ser Glu He Ser Ala Asp 
485 490 495 


Glu Trp Ser Arg Gly Asn He Leu Lys Asn Ser Val Gly Glu Asn He 
500 505 510 


Glu Pro Val Lys He Leu Val Pro Glu Lys Ser Leu Pro Cys Lys Val 
515 520 525 


Ser Asn Asn Asn Tyr Pro He Pro Glu Gin Met Asn Leu Asn Glu Asp 
530 535 540 


Ser Cys Asn Lys Lys Ser Asn Val He Asp Asn Lys Ser Gly Lys Val 
545 550 555 560 


Thr Ala Tyr Asp Leu Leu Ser Asn Arg Val He Lys Lys Pro Met Ser 
565 570 575 


Ala Ser Ala Leu Phe Val Gin Asp His Arg Pro Gin Phe Leu He Glu 
580 585 590 


Asn Pro Lys Thr Ser Leu Glu Asp Ala Thr Leu Gin He Glu Glu Leu 
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Trp Lys Thx Leu Ser Glu Glu Glu Lys Leu Lys Tyr Glu Glu Lys Ala 
510 615 620 


Thr Lys Asp Leu Glu Arg Tyr Aan Ser Gin Met Lys Arg Ala He Glu 
625 630 635 640 


Gin Glu Ser Gin Met Ser Leu Lys Asp Gly Arg Lys Lys He Lys Pro 
645 650 655 


Thr Ser Ala Trp Asn Leu Ala Gin Lys His Lys Leu Lys Thr Ser Leu 
660 665 670 ■ 


Ser Asn Gin Pro Lys Leu Asp Glu Leu Leu Gin Ser Gin lie Glu Lys 
675 680 685 


Arg Arg Ser Gin Asn He Lys Met Val Gin He Pro Phe Ser Met Lys 
690 695 700 


Asn Leu Lys He Asn Phe Lys Lys Gin Asn Lys Val Asp Leu Glu Glu 
705 710 715 720 

Lys Asp Glu Pro Cys Leu He His Asn Leu Arg Phe Pro Asp Ala Trp 
725 730 735 


Leu Met Thr Ser Lys Thr Glu Val Met Leu Leu Asn Pro Tyr Arg Val 
740 745 750 


Glu Glu Ala Leu Leu Phe Lys Arg Leu Leu Glu Asn His Lys Leu Pro 
755 760 765 


Ala Glu Pro Leu Glu Lys Pro He Met Leu Thr Glu Ser Leu Phe Asn 
770 775 780 


Gly Ser His Tyr Leu Asp Val Leu Tyr Lys Met Thr Ala Asp Asp Gin 
785 790 795 800 


Arg Tyr Ser Gly Ser Thr Tyr Leu Ser Asp Pro Arg Leu Thr Ala Asn 
805 810 815 


Gly Phe Lys He Lys Leu He Pro Gly Val Ser He Thr Glu Asn Tyr 
820 825 830 


Leu Glu He Glu Gly Met Ala Asn Cys Leu Pro Phe Tyr Gly Val Ala 
835 840 845 


Asp Leu Lys Glu He Leu Asn Ala He Leu Asn Arg Asn Ala Lys Glu 
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850 855 ■ 860 

Val Tyr Glu Cys Arg Pro Arg Lys Val He Ser Tyr Leu Glu Gly Glu 
8S5 870 875 880 

Ala Val Arg Leu Ser Arg Gin Leu Pro Met Tyr Leu Ser Lys Glu Asp 
885 890 895 

He Gin Asp He He Tyr Arg Met Lys His Gin Phe Gly Asn Glu He 
900 905 910 

Lys Glu Cys Val His Gly Arg Pro Phe Phe His His Leu Thr Tyr Leu 
915 920 925 

Pro Glu Thr Thr 
930 

<210> 9 

<211> 3145 

<212> DNA 

<213> Homo sapiens 


<400> 9 


ggcgggaaac 

agcttagtgg 

gtgtggggtc 

gcgcattttc ttcaaccagg 

aacrtcracrcracr 


gtttcgacat 

ggcggtgcag 

ccgaaggaga 

cgctgcagtt ggagagcgcg 

gccgaggtcg 

120 

gcttcgtgcg 

cttGtttcag 

ggcatgccgg 

agaagccgac caccacagtg 

cgccttttcg 

180 

accggggcga 

cttctatacg 

gcgcacggcg 

aggacgcgct gctggccgcc 

cgggaggtgt 

240 

tcaagaccca 

gggggtgatc 

aagtacatgg 

ggccggcagg agcaaagaat 

ctgcagagtg 

300 

ttgtgcttag 

taaaatgaat 

tttgaatctt 

ttgtaaaaga tcttcttctg 

gttcgtcagt 

360 

atagagttga 

agtttataag 

aatagagctg 

gaaataaggc atccaaggag 

aatgattggt 

420 

atttggcata 

taaggcttct 

cctggcaatc 

tctctcagtt tgaagacatt 

ctctttggta 

480 

acaatgatat 

gtcagcttcc 

attggtgttg 

tgggtgttaa aatgtccgca 

gttgatggcc 

540 

agagacaggt 

tggagttggg 

tatgtggatt 

ccatacagag gaaactagga 

ctgtgtgaat 

600 

tccctgataa 

tgatcagttc 

tccaatcttg 

aggctctcct catccagatt 

ggaccaaagg 

660 

aatgtgtttt 

acccggagga 

gagactgctg 

gagacatggg gaaactgaga 

cagataattc 

720 

aaagaggagg 

aattctgatc 

acagaaagaa 

aaaaagctga cttttccaca 

aaagacattt 

780 

atcaggacct 

caaccggttg 

ttgaaaggca 

aaaagggaga gcagatgaat 

agtgctgtat 

840 

tgccagaaat 

ggagaatcag 

gttgcagttt 

catcactgtc tgcggtaatc 

aagtttttag 

900 

aactcttatc 

agatgattcc 

aactttggac 

agtttgaact gactactttt 

gacttcagcc 

960 

agtatatgaa 

attggatatt 

gcagcagtca 

gagcccttaa cctttttcag 

ggttctgttg 

1020 

aagataccac 

tggctctcag 

tctctggctg 

ccttgctgaa taagtgtaaa 
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gacaaagact tgttaaccag tggattaagc agcctctcat ggataagaac agaatagagg 1140 

agagattgaa tttagtggaa gcttttgtag aagatgcaga attgaggcag actttacaag 12 00 

aagatttact tcgtcgattc ccagatctta accgacttgc caagaagttt caaagacaag 1250 

cagcaaactt acaagattgt taccgactct atcagggtat aaatcaacta cctaatgtta 1320 

tacaggctct ggaaaaacat gaaggaaaac accagaaatt attgttggca gtttttgtga 1380 

ctcctcttac tgatcttcgt tctgacttct ccaagtttca ggaaatgata gaaacaactt 1440 

tagatatgga tcaggtggaa aaccatgaat tccttgtaaa accttcattt gatcctaatc 1500 

tcagtgaatt aagagaaata atgaatgact tggaaaagaa gatgcagtca acattaataa 1560 

gtgcagccag agatcttggc ttggaccctg gcaaacagat taaactggat tccagtgcac 1620 

agtttggata ttactttcgt gtaacctgta aggaagaaaa agtccttcgt aacaataaaa 1680 

actttagtac tgtagatatc cagaagaatg gtgttaaatt taccaacagc aaattgactt 1740 

ctttaaatga agagtatacc aaaaataaaa cagaatatga agaagcccag gatgccattg 1800 

ttaaagaaat tgtcaatatt tcttcaggct atgtagaacc aatgcagaca ctcaatgatg 1860 

tgttagctca gctagatgct gttgtcagct ttgctcacgt gtcaaatgga gcacctgttc 1920 

catatgtacg accagccatt ttggagaaag gacaaggaag aattatatta aaagcatcca 1980 

ggcatgcttg tgttgaagtt caagatgaaa ttgcatttat tcctaatgac gtatactttg 2040 

aaaaagataa acagatgttc cacatcatta ctggccccaa tatgggaggt aaatcaacat 2100 

atattcgaca aactggggtg atagtactca tggcccaaat tgggtgtttt gtgccatgtg 2160 

agtcagcaga agtgtccatt gtggactgca tcttagcccg agtaggggct ggtgacagtc 2220 

aattgaaagg agtctccacg ttcatggctg aaatgttgga aactgcttct atcctcaggt 2280 

ctgcaaccaa agattcatta ataatcatag atgaattggg aagaggaact tctacctacg 2340 

atggatttgg gttagcatgg gctatatcag aatacattgc aacaaagatt ggtgcttttt 2400 

gcatgtttgc aacccatttt catgaactta ctgccttggc caatcagata ccaactgtta 2460 

ataatctaca tgtcacagca ctcaccactg aagagacctt aactatgctt tatcaggtga 2520 

agaaaggtgt ctgtgatcaa agttttggga ttcatgttgc agagcttgct aatttcccta 2580 

agcatgtaat agagtgtgct aaacagaaag ccctggaact tgaggagttt cagtatattg 2640 

gagaatcgca aggatatgat atcatggaac cagcagcaaa gaagtgctat ctggaaagag 2700 

agcaaggtga aaaaattatt caggagttcc tgtccaaggt gaaacaaatg ccctttactg 2760 

aaatgtcaga agaaaacatc acaataaagt taaaacagct aaaagctgaa gtaatagcaa 2820 
agaataatag ctttgtaaat gaaatcattt cacgaataaa agttactacg tgaaaaatcc 28 80 
cagtaatgga atgaaggtaa tattgataag ctattgtctg taatag-tttt atattgtttt 2940 
atattaaccc tttttccata gtgttaactg tcagtgccca tgggctatca acttaataag 3000 
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atatttagta atattttact ttgaggacat tttcaaagat ttttattttg aaaaatgaga 3060 
gctgtaactg aggactgttt gcaattgaca taggcaataa taagtgatgt gctgaatttt 3120 
ataaataaaa tcatgtagtt tgtgg 3145 

<210> 10 

<211> 934 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Ala Val Gin Pro Lys Glu Thr Leu Gin Leu Glu Ser Ala Ala Glu 
1 ■ 5 10 15 

Val Gly Phe Val Arg Phe Phe Gin Gly Met Pro Glu Lya Pro Thr Thr 


Thr Val Arg Leu Phe Asp Arg Gly Asp Phe Tyr Thr Ala Hia Gly Glu 
35 40 45 


Asp Ala Leu Leu Ala Ala Arg Glu Val Phe Lys Thr Gin Gly Val lie 
50 55 60 

Lys Tyr Met Gly Pro Ala Gly Ala Lys Asn Leu Gin Ser Val Val Leu 


Ser Lys Met Asn Phe Glu Ser Phe Val Lys Asp Leu Leu Leu Val Arg 
85 90 95 


Gin Tyr Arg Val Glu Val Tyr Lys Asn Arg Ala Gly Asn Lys Ala Ser 
100 105 110 


Lys Glu Asn Asp Trp Tyr Leu Ala Tyr Lys Ala Ser Pro Gly Asn Leu 
115 120 125 


Ser Gin Phe Glu Asp lie Leu Phe Gly Asn Asn Asp Met Ser Ala Ser 
130 135 140 


lie Gly Val Val Gly Val Lys Met Ser Ala Val Asp Gly Gin Arg Gin 
145 150 155 160 


Val Gly Val Gly Tyr Val Asp Ser lie Gin Arg Lys Leu Gly Leu Cys 
165 170 175 


Glu Phe Pro Asp Asn Asp Gin Phe Ser Asn Leu Glu Ala Leu Leu lie 
180 185 190 


Gin He Gly Pro Lys Glu Cys Val Leu Pro Gly Gly Glu Thr Ala Gly 
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Asp Met Gly Lys Leu Arg Gin lie lie Gin Arg Gly Gly He Leu He 
210 215 220 


Thr Glu Arg Lys Lys Ala Asp Phe Ser Thr Lys Aap He Tyr Gin Asp 
225 230 235 240 

Leu Asn Arg Leu Leu Lys Gly Lys Lys Gly Glu Gin Met Asn Ser Ala 
245 250 255 


Val Leu Pro Glu Met Glu Asn Gin Val Ala Val Ser Ser Leu Ser Ala 
260 265 270 

Val He Lys Phe Leu Glu Leu Leu Ser Asp Asp Ser Asn Phe Gly Gin 
275 280 285 


Phe Glu Leu Thr Thr Phe Asp Phe Ser Gin Tyr Met Lys Leu Asp He 
290 295 300 

Ala Ala Val Arg Ala Leu Asn Leu Phe Gin Gly Ser Val Glu Asp Thr 
305 310 315 320 

Thr Gly Ser Gin Ser Leu Ala Ala Leu Leu Asn Lys Cys Lys Thr Pro 
325 330 335 


Gin Gly Gin Arg Leu Val Asn Gin Trp He Lys Gin Pro Leu Met Asp 
340 345 350 

Lys Asn Arg He Glu Glu Arg Leu Asn Leu Val Glu Ala Phe Val Glu 
355 360 365 

Asp Ala Glu Leu Arg Gin Thr Leu Gin Glu Asp Leu Leu Arg Arg Phe 
370 375 380 

Pro Asp Leu Asn Arg Leu Ala Lys Lys Phe Gin Arg Gin Ala Ala Asn 
385 390 395 400 

Leu Gin Asp Cys Tyr Arg Leu Tyr Gin Gly He Asn Gin Leu Pro Asn 
405 410 415 


Val He Gin Ala Leu Glu Lys His Glu Gly Lys His Gin Lys Leu Leu 
420 425 430 


Leu Ala Val Phe Val Thr Pro Leu Thr Asp Leu Arg Ser Asp Phe Ser 
435 440 445 


Lys Phe Gin Glu Met He Glu Thr Thr Leu Asp Met Asp Gin Val Glu 
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Asn His Glu Phe Leu Val Lys Pro Ser Phe Asp Pro Asn Leu Ser Glu 
465 470 475 480 


Leu Arg Glu lie Met Asn Asp Leu Glu Lys Lys Met Gin Ser Thr Leu 
485 490 495 


lie Ser Ala Ala Arg Asp Leu Gly Leu Asp Pro Gly Lys Gin lie Lys 
500 505 510 


Leu Asp Ser Ser Ala Gin Phe Gly Tyr Tyr Phe Arg Val Thr Cys Lys 
515 520 525 


Glu Glu Lys Val Leu Arg Asn Asn Lys Asn Phe Ser Thr Val Asp lie 
530 535 540 


Gin Lys Asn Gly Val Lys Phe Thr Asn Ser Lys Leu Thr Ser Leu Asn 
545 550 555 560 

Glu Glu Tyr Thr Lys Asn Lya Thr Glu Tyr Glu Glu Ala Gin Asp Ala 
565 570 575 


He Val Lys Glu He Val Asn He Ser Ser Gly Tyr Val Glu Pro Met 
580 585 590 


Gin Thr Leu Asn Asp Val Leu Ala Gin Leu Asp Ala Val Val Ser Phe 
595 600 605 


Ala His Val Ser Asn Gly Ala Pro Val Pro Tyr Val Arg Pro Ala lie 
610 615 620 

Leu Glu Lys Gly Gin Gly Arg He He Leu Lys Ala Ser Arg His Ala 
625 630 635 640 


Cys Val Glu Val Gin Asp Glu He Ala Phe He Pro Asn Asp Val Tyr 
645 650 655 


Phe Glu Lys Asp Lys Gin Met Phe His He He Thr Gly Pro Asn Met 
660 665 670 


Gly Gly Lys Ser Thr Tyr He Arg Gin Thr Gly Val He Val Leu Met 
675 680 685 


Ala Gin He Gly Cys Phe Val Pro Cys Glu Ser Ala Glu Val Ser He 
690 695 700 


Val Asp Cys He Leu Ala Arg Val Gly Ala Gly Asp Ser Gin Leu Lys 
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Gly Val Ser Thr Phe Met Ala Glu Met Leu Glu Thr Ala Ser lie Leu 
725 730 735 


Arg Ser Ala Thr Lys Asp Ser Leu lie lie lie Asp Glu Leu Gly Arg 
740 745 750 

Gly Thr Ser Thr Tyr Asp Gly Phe Gly Leu Ala Trp Ala lie Ser Glu 
755 760 765 


Tyr lie Ala Thr Lys lie Gly Ala Phe Cys Met Phe Ala Thr His Phe 
770 775 780 


His Glu Leu Thr Ala Leu Ala Asn Gin He Pro Thr Val Asn Asn Leu 
785 790 795 800 

His Val Thr Ala Leu Thr Thr Glu Glu Thr Leu Thr Met Leu Tyr Gin 
805 810 815 


Val Lys Lys Gly Val Cys Asp Gin Ser Phe Gly He His Val Ala Glu 
820 825 830 

Leu Ala Asn Phe Pro Lys His Val He Glu Cys Ala Lys Gin Lys Ala 
835 840 845 

Leu Glu Leu Glu Glu Phe Gin Tyr He Gly Glu Ser Gin Gly Tyr Asp 
850 855 860 

He Met Glu Pro Ala Ala Lys Lys Cys Tyr Leu Glu Arg Glu Gin Gly 
865 870 875 880 

Glu Lys He He Gin Glu Phe Leu Ser Lys Val Lys Gin Met Pro Phe 
885 890 895 


Thr Glu Met Ser Glu Glu Asn He Thr He Lys Leu Lys Gin Leu Lys 
900 905 910 

Ala Glu Val He Ala Lys Asn Asn Ser Phe Val Asn Glu He He Ser 
915 920 925 


Arg He Lys Val Thr Thr 
930 


<210> 11 

<211> 2484 

<212> DNA 

<213i Homo sapiens 
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<400> 11 

cttggctctt ctggcgccaa aatgtcgttc gtggcagggg ttattcggcg gctggacgag 60 

acagtggtga accgcatcgc ggcgggggaa gttatccagc ggccagctaa tgctatcaaa 12 0 

gagatgattg agaactgttt agatgcaaaa tccacaagta ttcaagtgat tgttaaagag 180 

ggaggcctga agttgattca gatccaagac aatggcaccg ggatcaggaa agaagatctg 240 

gatattgtat gtgaaaggtt cactactagt aaactgcagt cctttgagga tttagccagt 300 

atttctacct atggctttcg aggtgaggct ttggccagca taagccatgt ggctcatgtt 360 

actattacaa cgaaaacagc tgatggaaag tgtgcataca gagcaagtta ctcagatgga 420 

aaactgaaag cccctcctaa accatgtgct ggcaatcaag ggacccagat cacggtggag 480 

gacctttttt acaacatagc cacgaggaga aaagctttaa aaaatccaag tgaagaatat 540 

gggaaaattt tggaagttgt tggcaggtat tcagtacaca atgcaggcat tagtttctca 600 

gttaaaaaac aaggagagac agtagctgat gttaggacac tacccaatgc ctcaaccgtg 660 

gacaatattc gctccatctt tggaaatgct gttagtcgag aactgataga aattggatgt 720 

gaggataaaa ccctagcctt caaaatgaat ggttacatat ccaatgcaaa ctactcagtg 780 

aagaagtgca tcttcttact cttcatcaac catcgtctgg tagaatcaac ttccttgaga 840 

aaagccatag aaacagtgta tgcagcctat ttgcccaaaa acacacaccc attcctgtac 900 

ctcagtttag aaatcagtcc ccagaatgtg gatgttaatg tgcaccccac aaagcatgaa 960 

gttcacttcc tgcacgagga gagcatcctg gagcgggtgc agcagcacat cgagagcaag 1020 

ctcctgggct ccaattcctc caggatgtac ttcacccaga ctttgctacc aggacttgct 1080 

ggcccctctg gggagatggt taaatccaca acaagtctga cctcgtcttc tacttctgga 1140 

agtagtgata aggtctatgc ccaccagatg gttcgtacag attcccggga acagaagctt 1200 

gatgcatttc tgcagcctct gagcaaaccc ctgtccagtc agccccaggc cattgtcaca 1260 

gaggataaga cagatatttc tagtggcagg gctaggcagc aagatgagga gatgcttgaa 1320 

ctcccagccG ctgctgaagt ggctgccaaa aatcagagct tggaggggga tacaacaaag 1380 

gggacttcag aaatgtcaga gaagagagga cctacttcca gcaaccccag aaagagacat 1440 

cgggaagatt ctgatgtgga aatggtggaa gatgattccc gaaaggaaab gactgcagct 1500 

tgtaccGccc ggagaaggat cattaacctc actagtgttt tgagtctcca ggaagaaatt 1560 

aatgagcagg gacatgaggt .tctccgggag atgttgcata accactcctt cgtgggctgt 1620 

gtgaatcctc agtgggcctt ggcacagcat caaaccaagt tataccttct caacaccacc 1680 

aagcttagtg aagaactgtt ctaccagata ctcatttatg attttgccaa ttttggtgtt 1740 

ctcaggttat cggagccagc accgctcttt gaccttgcca tgcttgcctt agatagtcca 1800 

gagagtggct ggacagagga agatggtccc aaagaaggac ttgctgaata cattgttgag 1860 

tttctgaaga agaaggctga gatgcttgaa gactatttct ctttggaaat tgatgaggaa 1920 
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gggaacctga ttggattacc ccttctgatt gacaactatg tgcccccttt ggagggactg 1980 

cctatcttca ttcttcgact agccactgag gtgaattggg acgaagaaaa ggaatgtttt 2040 

gaaagcctca gtaaagaatg cgctatgttc tattccatcc ggaagcagta catatctgag 2100 

gagtcgaccc tctcaggcca gcagagtgaa gtgcctggct ccattccaaa ctcctggaag 2160 

tggactgtgg aacacattgt ctataaagcc ttgcgctcac acattctgcc tcctaaacat 2220 

ttcacagaag atggaaatat cctgcagctt gctaacctgc ctgatctata caaagtcttt 2280 

gagaggtgtt aaatatggtt atttatgcac tgtgggatgt gttcttcttt ctctgtattc 2340 

cgatacaaag tgttgtatca aagtgtgata tacaaagtgt accaacataa gtgttggtag 2400 

cacttaagac ttatacttgc cttctgatag tattccttta tacacagtgg attgattata 2460 

aataaataga tgtgtcttaa cata 2484 

<210> 12 

<211> 756 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Met Ser Phe Val Ala Gly Val lie Arg Arg Leu Asp Glu Thr Val Val 
15 10 15 

Asn Arg He Ala Ala Gly Glu Val He Gin Arg Pro Ala Asn Ala He 
20 25 30 

Lys Glu Met He Glu Asn Cys Leu Asp Ala Lys Ser Thr Ser He Gin 
35 40 45 

Val He Val Lys Glu Gly Gly Leu Lys Leu He Gin He Gin Asp Asn 
50 55 60 

Gly Thr Gly He Arg Lys Glu Asp Leu Asp He 'Val Cys Glu Arg Phe 


Thr Thr Ser Lys Leu Gin Ser Phe Glu Asp Leu Ala Ser He Ser Thr 
85 90 95 


Tyr Gly Phe Arg Gly Glu Ala Leu Ala Ser He Ser His Val Ala His 
100 105 110 


Val Thr He Thr Thr Lys Thr Ala Asp Gly Lys Cys Ala Tyr Arg Ala 
115 120 125 


Ser Tyr Ser Asp Gly Lys Leu Lys Ala Pro Pro Lys Pro Cys Ala Gly 
130 135 140 
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Asn Gin Gly Thr Gin lie Thr Val Glu Asp Leu Phe Tyr Asn lie Ala 
145 150 155 160 


Thr Arg Arg Lys Ala Leu Lys Asn Pro Ser Glu Glu Tyr Gly Lys lie 
165 170 175 


Leu Glu Val Val Gly Arg Tyr Ser Val His Asn Ala Gly He Ser Phe 
180 185 190 


Ser Val Lys Lys Gin Gly Glu Thr Val Ala Asp Val Arg Thr Leu Pro 
195 200 205 


Asn Ala Ser Thr Val Asp Asn He Arg Ser He Phe Gly Asn Ala Val 

210 215 220 


Ser Arg Glu Leu lie Glu He Gly Cys Glu Asp Lys Thr Leu Ala Phe 
225 230 235 240 


He Phe Leu Leu Phe He Asn His Arg Leu Val Glu Ser Thr Ser Leu 
260 265 270 


Arg Lys Ala lie Glu Thr Val Tyr Ala Ala Tyr Leu Pro Lys Asn Thr 
275 280 285 


His Pro Phe Leu Tyr Leu Ser Leu Glu He Ser Pro Gin Asn Val Asp 
290 295 300 


Val Asn Val His Pro Thr Lys His Glu Val His Phe Leu His Glu Glu 
305 310 315 320 


Ser He Leu Glu Arg Val Gin Gin His He Glu Ser Lys Leu Leu Gly 
325 330 335 


Ser Asn Ser Ser Arg Met Tyr Phe Thr Gin Thr Leu Leu Pro Gly Leu 
340 345 350 


Ala Gly Pro Ser Gly Glu Met Val Lys Ser Thr Thr Ser Leu Thr Ser 
355 360 365 


Ser Ser Thr Ser Gly Ser Ser Asp Lys Val Tyr Ala His Gin Met Val 
370 375 380 


Arg Thr Asp Ser Arg Glu Gin Lys Leu Asp Ala Phe Leu Gin Pro Leu 
385 390 395 400 
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Ser Lys Pro Leu Ser Ser Gin Pro Gin Ala He Val Thr Glu Asp Lys 
405 410 415 

Thr Asp lie Ser Ser Gly Arg Ala Arg Gin Gin Asp Glu Glu Met Leu 
420 425 430 

Glu Leu Pro Ala Pro Ala Glu Val Ala Ala Lys Asn Gin Ser Leu Glu 
435 440 445 

Gly Asp Thr Thr Lys Gly Thr Ser Glu Met Ser Glu Lys Arg Gly Pro 
450 455 460 

Thr Ser Ser Asn Pro Arg Lys Arg His Arg Glu Asp Ser Asp Val Glu 
465 470 475 480 

Met Val Glu Asp Asp Ser Arg Lys Glu Met Thr Ala Ala Cys Thr Pro 
485 490 495 

Arg Arg Arg He He Asn Leu Thr Ser Val Leu Ser Leu Gin Glu Glu 
500 505 510 

He Asn Glu Gin Gly His Glu Val Leu Arg Glu Met Leu His Asn His 
515 520 525 

Ser Phe Val Gly Cys Val Asn Pro Gin Trp Ala Leu Ala Gin His Gin 
530 535 540 

Thr Lys Leu Tyr Leu Leu Asn Thr Thr Lys Leu Ser Glu Glu Leu Phe 
545 550 555 560 

Tyr Gin He Leu He Tyr Asp Phe Ala Asn Phe Gly Val Leu Arg Leu 
565 570 575 

Ser Glu Pro Ala Pro Leu Phe Asp Leu Ala Met Leu Ala Leu Asp Ser 
580 585 590 

Pro Glu Ser Gly Trp Thr Glu Glu Asp Gly Pro Lys Glu Gly Leu Ala 
595 SOD 605 

Glu Tyr He Val Glu Phe Leu Lys Lys Lys Ala Glu Met Leu Ala Asp 
610 615 , 620 

Tyr Phe Ser Leu Glu He Asp Glu Glu Gly Asn Leu He Gly Leu Pro 

625 630 635 640 

Leu Leu He Asp Asn Tyr Val Pro Pro Leu Glu Gly Leu Pro He Phe 
645 650 655 
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Phe Glu Ser Leu Ser Lys Glu Cys Ala Met Phe Tyr Ser He Arg Lys 
675 680 685 


Gin Tyr lie Ser Glu Glu Ser Thr Leu Ser Gly Gin Gin Ser Glu Val 
690 695 700 


Pro Gly Ser lie Pro Asn Ser Trp Lys Trp Thr Val Glu His He Val 
705 710 715 720 


Asp Gly Asn He Leu Gin Leu Ala Asn Leu Pro Asp Leu Tyr Lys Val 
740 745 750 


<210> 13 

<211> 426 

<212> DNA 

<213> Homo sapiens 

<400> 13 

cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 60 

aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 

ctgagtctaa gcactgeggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 

aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 240 

tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 

caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 360 

tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 420 

acttga 426 

<210> 14 
<211> 133 

<212> .PRT 

<213> Homo sapiens 

<400> 14 

Met Lys Gin Leu Pro Ala Ala Thr Val Arg Leu Leu Ser Ser Ser Gin 


He He Thr Ser Val Val Ser Val Val Lys Glu Leu He Glu Asn Ser 
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20 25 30 


Leu Asp Ala Gly Ala Thr Ser Val Asp Val Lys Leu Glu Asn Tyr Gly 
35 40 45 


Phe Asp Lys He Glu Val Arg Asp Asn Qly Glu Gly He Lys Ala Val 
50 55 60 


Asp Ala Pro Val Met Ala Met Lys Tyr Tyr Thr Ser Lys He Asn Ser 
65 70 75 80 


His Glu Asp Leu Glu Asn Leu Thr Thr Tyr Gly Phe Arg Gly Glu Ala 
85 90 95 


Leu Gly Ser He Cys Cys He Ala Glu Val Leu He Thr Thr Arg Thr 
100 105 110 


Ala Ala Asp Asn Phe Ser Thr Gin Tyr Val Leu Asp Gly Ser Gly His 
115 120 125 


He Leu Ser Gin Lys 
130 


<210> 15 

<211> 4264 

<212> DNA 

<213> Homo sapiens 

<400> 15 

atttcccgcc agcaggagcc gcgcggtaga tgcggtgctt ttaggagctc cgtccgacag SO 

aacggttggg ccttgccggc tgtcggtatg tcgcgacaga gcaccctgta cagcttcttc 120 

cccaagtctc cggcgctgag tgatgccaac aaggcctcgg ccagggcctc acgcgaaggc 180 

ggccgtgccg ccgctgcccc cggggcctct ccttccccag gcggggatgc ggcctggagc 240 

gaggctgggc ctgggcccag gcccttggcg cgatccgcgt caccgcccaa ggcgaagaac 300 

ctcaacggag ggctgcggag atcggtagcg cctgctgccc ccaccagttg tgacttctca 360 

ccaggagatt tggtttgggc caagatggag ggttacccct ggtggccttg tctggtttac 420 

aaccacccct ttgatggaac attcatccgc gagaaaggga aatcagtccg tgttcatgta 480 

cagttttttg atgacagccc aacaaggggc tgggttagca aaaggctttt aaagccatat 540 

acaggttcaa aatcaaagga agcccagaag ggaggtcatt tttacagtgc aaagcctgaa 600 

atactgagag caatgcaacg tgcagatgaa gccttaaata aagacaagat taagaggctt 660 

gaattggcag tttgtgatga gccctcagag ccagaagagg aagaagagat ggaggtaggc 720 

acaacttacg taacagataa gagtgaagaa gataatgaaa ttgagagtga agaggaagta 780 

cagcctaaga cacaaggatc taggcgaagt agccgccaaa taaaaaaacg aagggtcata 840 
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tcagattctg agagtgacat tggtggctct gatgtggaat ttaagccaga cactaaggag 900 

gaaggaagca gtgatgaaat aagcagtgga gtgggggata gtgagagtga aggcctgaac 960 

agccctgtca aagttgctcg aaagcggaag agaatggtga ctggaaatgg ctctcttaaa 1020 

aggaaaagct ctaggaagga aacgccctca gccaccaaac aagcaactag catttcatca 1080 

gaaaccaaga atactttgag agctttctct gcccctcaaa attctgaatc ccaagcccac 1140 

gttagtggag gtggtgatga cagtagtcgc cctactgttt ggtatcatga aactttagaa 1200 

tggcttaagg aggaaaagag aagagatgag cacaggagga ggcctgatca ccccgatttt 1260 

gatgcatcta cactctatgt gcctgaggat ttcctcaatt cttgtactcc tgggatgagg 1320 

aagtggtggc agattaagtc tcagaacttt gatcttgtca tctgttacaa ggtggggaaa 1380 

ttttatgagc tgtaccacat ggatgctctt attggagtca gtgaactggg gctggtattc 1440 

atgaaaggca actgggccca ttctggcttt cctgaaattg catttggccg ttattcagat 1500 

tccctggtgc agaagggcta taaagtagca cgagtggaac agactgagac tccagaaatg 1560 

atggaggcac gatgtagaaa gatggcacat atatccaagt atgatagagt ggtgaggagg 1620 

gagatctgta ggatcattac caagggtaca cagacttaca gtgtgctgga aggtgatccc 1680 

tctgagaact acagtaagta tcttcttagc ctcaaagaaa aagaggaaga ttcttctggc 1740 

catactcgtg catatggtgt gtgctttgtt gatacttcac tgggaaagtt tthcataggt 1800 

cagttttcag atgatcgcca ttgttcgaga tttaggactc tagtggcaca ctatccccca 1860 

gtacaagttt tatttgaaaa aggaaatctc tcaaaggaaa ctaaaacaat tctaaagagt 1920 

tcattgtcct gttctcttca ggaaggtctg atacccggct cccagttttg ggatgcatcc 1980 

aaaactttga gaactctcct tgaggaagaa tattttaggg aaaagctaag tgatggcatt . 2040 

ggggtgatgt taccccaggt gcttaaaggt atgacttcag agtctgattc cattgggttg 2100 

acaccaggag agaaaagtga attggccctc tctgctctag gtggttgtgt cttctacctc 2160 

aaaaaatgcc ttattgatca ggagctttta tcaatggcta attttgaaga atatattccc 2220 

ttggattctg acacagtcag cactacaaga tctggtgcta tcttcaccaa agcctatcaa 2280 

cgaatggtgc tagatgcagt gacattaaac aacttggaga tttttctgaa tggaacaaat 2340 

ggttctactg aaggaaccct actagagagg gttgatactt gccatactcc ttttggtaag 2400 

cggctcctaa agcaatggct ttgtgcccca ctctgtaacc attatgctat taatgatcgt 2460 

ctagatgcca tagaagacct catggttgtg cctgacaaaa tctccgaagt tgtagagctt 2520 

ctaaagaagc ttccagatct tgagaggcta ctcagtaaaa ttcataatgt tgggtctccc 2580 

ctgaagagtc agaaccaccc agacagcagg gctataatgt atgaagaaac tacatacagc 2640 

aagaagaaga ttattgattt tctttctgct ctggaaggat tcaaagtaat gtgtaaaatt 2700 

atagggatca tggaagaagt tgctgatggt tttaagtcta aaatccttaa gcaggtcatc 2760 
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tctctgcaga caaaaaatcc tgaaggtcgt tttcctgatt tgactgtaga attgaaccga 2820 

tgggatacag cctttgacca tgaaaaggct cgaaagactg gacttattac tcccaaagca 2 880 

ggctttgact ctgattatga ccaagctctt gctgacataa gagaaaatga acagagcctc 2940 

ctggaatacc tagagaaaca gcgcaacaga attggctgta ggaccatagt ctattggggg 3000 

attggtagga accgttacca gctggaaatt cctgagaatt tcaccactcg caatttgcca 3060 

gaagaatacg agttgaaatc taccaagaag ggctgtaaac gatactggac caaaactatt 3120 

gaaaagaagt tggctaatct cataaatgct gaagaacgga gggatgtatc attgaaggac 3180 

tgcatgcggc gactgttcta taactttgat aaaaattaca aggactggca gtctgctgta 3240 

gagtgtatcg cagtgttgga tgttttactg tgcctggcta actatagtcg agggggtgat 3300 

ggtcctatgt gtcgcccagt aattctgttg ccggaagata cccccccctt cttagagctt 33 50 

aaaggatcac gccatccttg cattacgaag actttttttg gagatgattt tattcctaat 3420 

gacattctaa taggctgtga ggaagaggag caggaaaatg gcaaagccta ttgtgtgctt 3480 

gttactggac caaatatggg gggcaagtct acgcttatga gacaggctgg cttattagct 3 540 

gtaatggccc agatgggttg ttacgtccct gctgaagtgt gcaggctcac accaattgat 3600 

agagtgttta ctagacttgg tgcctcagac agaataatgt caggtgaaag tacatttttt 3660 

gttgaattaa gtgaaactgc cagcatactc atgcatgcaa cagcacattc tctggtgctt 3720 

gtggatgaat taggaagagg tactgcaaca tttgatggga cggcaatagc aaatgcagtt 3780 

gttaaagaac ttgctgagac tataaaatgt cgtacattat tttcaactca ctaccattca 3840 

ttagtagaag attattctca aaatgttgct gtgcgcctag gacatatggc atgcatggta 3900 

gaaaatgaat gtgaagaccc cagccaggag actattacgt tcctctataa attcattaag 3960 

ggagcttgtc ctaaaagcta tggctttaat gcagcaaggc ttgctaatct cccagaggaa 4020 

gttattcaaa agggacatag aaaagcaaga gaatttgaga agatgaatca gtcactacga 4080 

ttatttcggg aagtttgcct ggctagtgaa aggtcaactg tagatgctga agctgtccat 4140 

aaattgctga ctttgattaa ggaattatag actgactaca ttggaagctt tgagttgact 4200 

tctgaccaaa ggtggtaaat tcagacaaca ttatgatcta ataaacttta ttttttaaaa 4260 

atga 4264 

<210> 16 

<211> 13S0 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Ser Arg Gin Ser Thr Leu Tyr Ser Phe Phe Pro Lys Ser Pro Ala 
15 10 15 
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Leu Ser Asp Ala Asn Lys Ala Ser Ala Arg Ala Ser Arg Glu Gly Gly 
20 25 30 

Arg Ala Ala Ala Ala Pro Gly Ala Ser Pro Ser Pro Gly Gly Asp Ala 
35 40 45 

Ala Trp ser Glu Ala Gly Pro Gly Pro Arg Pro Leu Ala Arg Ser Ala 
50 55 60 

Ser Pro Pro Lys Ala Lys Asn Leu Asn Gly Gly Leu Arg Arq Ser Val 
S5 70 75 ao 

Ala Pro Ala Ala Pro Thr Ser Cys Asp Phe Ser Pro Gly Asp Leu Val 
85 90 95 

Trp Ala Lys Met Glu Gly Tyr Pro Trp Trp Pro Cys Leu Val Tyr Asn 
100 105 13^0 

His Pro Phe Asp Gly Thr Phe lie Arg Glu Lys Gly Lys Ser Val Arg 
115 120 125 

Val His Val Gin Phe Phe Asp Asp Ser Pro Thr Arg Gly Trp Val Ser 
130 135 -^40 

Lys Arg Leu Leu Lys Pro Tyr Thr Gly Ser Lys Ser Lys Glu Ala Gin 
l-^S 150 155 160 

Lys Gly Gly His Phe Tyr Ser Ala Lys Pro Glu lie Leu Arg Ala Met 
1S5 170 

Gin Arg Ala Asp Glu Ala Leu Asn Lys Asp Lys He Lys Arg Leu Glu 
180 185 190 

Leu Ala Val Cys Asp Glu Pro Ser Glu Pro Glu Glu Glu Glu Glu Met 
195 200 205 

Glu Val Gly Thr Thr Tyr Val Thr Asp Lys Ser Glu Glu Asp Asn Glu 
210 215 220 

He Glu Ser Glu Glu Glu Val Gin Pro Lys Thr Gin Gly Ser Arc Ara 

225 . 230 ■ 235 240 

Ser Ser Arg Gin He Lys Lys Arg Arg, Val He Ser Asp Ser Glu Ser 
245 250 255 

Asp He Gly Gly Ser Asp Val Glu Phe Lys Pro Asp Thr Lys Glu Glu 
250 265 270 
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Gly Ser Ser Asp Glu lie Ser Ser Gly Val Gly Asp Ser Glu Ser Glu 
275 280 285 


Gly Leu Asn Ser Pro Val Lys Val Ala Arg Lys Arg Lys Arg Met Val 
290 295 300 


Thr Gly Asn Gly Ser Leu Lya Arg Lys Ser Ser Arg Lys Glu Thr Pro 
305 310 315 320 


Leu Arg Ala Phe Ser Ala Pro Gin Asn Ser Glu Ser Gin Ala His Val 

340 345 350 


Thr Leu Glu Trp Leu Lys Glu Glu Lys Arg Arg Asp Glu His Arg Arg 
370 375 380 


Arg Pro Asp His Pro Asp Phe Asp Ala Ser Thr Leu Tyr Val Pro Glu 
385 390 395 400 


Asp Phe Leu Asn Ser Cys Thr Pro Gly Met Arg Lys Trp Trp Gin lie 
405 410 415 


Lys Ser Gin Asn Phe Asp Leu Val lie Cys Tyx Lys Val Gly Lys Phe 
420 425 430 


Tyr Glu Leu Tyr His Met Asp Ala Leu lie Gly Val Ser Glu Leu Gly 
435 440 445 


Leu Val Phe Met Lys Gly Asn Trp Ala His Ser Gly Phe Pro Glu lie 
450 455 460 


Ala Phe Gly Arg Tyr Ser Asp Ser Leu Val Gin Lys Gly Tyr Lys Val 
465 470 475 480 


Ala Arg Val Glu Gin Thr Glu Thr Pro Glu Met Met Glu Ala Arg Cys 
485 490 495 


Arg Lys Met Ala His lie Ser Lys Tyr Asp Arg Val Val Arg Arg Glu 

500 505 510 


lie Cys Arg lie lie Thr Lys Gly Thr Gin Thr Tyr Ser Val Leu Glu 
515 520 525 
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61y Asp Pro Ser Glu Asn Tyr Ser Lys Tyr Leu Leu Ser Leu Lys Glu 
530 535 540 


Val Aep Thr Ser Leu Gly Lys Phe Phe lie Gly Gin Phe Ser Asp Asp 
565 570 575 


Gin Val Leu Phe Glu Lys Gly Asn Leu Ser Lys Glu Thr Lys Thr lie 
595 600 SOB 


Leu Ijys Ser Ser Leu Ser Cys Ser Leu Gin Glu Gly Leu lie Pro Gly 
610 615 620 


Gin Val Leu Lys Gly Met Thr Ser Glu Ser Asp Ser lie Gly Leu Thr 
. 660 665 670 


Pro Gly Glu Lys Ser Glu Leu Ala Leu Ser Ala Leu Gly Gly Cys Val 
675 680 685 


Phe Tyr Leu Lys Lys Cys Leu lie Asp Gin Glu Leu Leu Ser Met Ala 
690, 695 700 


Asn Phe Glu Glu Tyr lie Pro Leu Asp Ser Asp Thr Val Ser Thr Thr 
705 710 715 720 


Arg Ser Gly Ala lie Phe Thr Lys Ala Tyr Gin Arg Met Val Leu Asp 
725 730 735 


Ala Val Thr Leu Asn Asn Leu Glu He Phe Leu Asn Gly Thr Asn Gly 

740 745 750 


Ser Thr Glu Gly Thr Leu Leu Glu Arg Val Asp Thr Cys His Thr Pro 
755 760 765 
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His Tyr Ala He Asn Asp Arg Leu Asp Ala He Glu Asp Leu Met Val 
785 790 795 800 

Val Pro Asp Lys He Ser Glu Val Val Glu Leu Leu Lys Lys Leu Pro 
805 810 815 

Asp Leu Glu Arg Leu Leu Ser Lys He His Asn Val Gly Ser Pro Leu 
820 825 830 

Lys Ser Sin Asn His Pro Asp Ser Arg Ala He Met Tyr Glu Glu Tlir 
835 840 845 

Thr Tyr Ser Lys Lys Lys He He Asp Phe Leu Ser Ala Leu Glu Gly 
850 855 860 

Phe Lys Val Met Cys Lys He He Gly He Met Glu Glu Val Ala Asp 
865 870 875 880 

Gly Phe Lys Ser Lys He Leu Lys Gin Val He Ser Leu Gin Thr Lys 
885 890 895 

Asn Pro Glu Gly Arg Phe Pro Asp Leu Thr Val Glu Leu Asn Arg Trp 
900 905 910 

Asp Thr Ala Phe Asp His Glu Lys Ala Arg Lys Thr Gly Leu He Thr 
915 920 925 

Pro Lys Ala Gly Phe Asp Ser Asp Tyr Asp Gin Ala Leu Ala Asp He 
930 935 940 

Arg Glu Asn Glu Gin Ser Leu Leu Glu Tyr Leu Glu Lys Gin Arg Asn 
945 950 955 960 

Arg He Gly Cys Arg Thr He Val Tyr Trp Gly He Gly Arg Asn Arg 
965 970 975 

Tyr Gin Leu Glu He Pro Glu Asn Phe Thr Thr Arg Asn Leu Pro Glu 
980 985 990 

Glu Tyr Glu Leu Lys Ser Thr Lys Lys Gly Cys Lys Arg Tyr Trp Thr 
995 1000 1005 

Lys Thr He Glu Lys Lys Leu Ala Asn Leu He Asn Ala Glu Glu 
1010 1015 1020 

Arg Arg Asp Val Ser Leu Lys Asp Cys Met Arg Arg Leu Phe Tyr 
1025 1030 1035 
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Asn Phe Asp Lys Asn Tyr Lys Asp Trp Gin Ser Ala Val Glu Cys 
104O 1045 1050 

lie Ala Val Leu Asp Val Leu Leu Cys Leu Ala Asn Tyr Ser Arg 
. 1055 1060 1065 

Gly C31y Asp Gly Pro Met Cys Arg Pro Val lie Leu Leu Pro Glu 
X070 1075 1080 


Asp Thr Pro Pro Phe Leu Glu Leu Lys Gly Ser Arg His Pro Cys 
1085 1090 1095 

He Thr Lys Thr Phe Phe Gly Asp Asp Phe He Pro Asn Asp He 
1100 1105 1110 


Leu He Gly Cys Glu Glu Glu Glu Gin Glu Asn Gly Lys Ala Tyr 
1115 1120 1125 

Cys Val Leu Val Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Leu 
1130 1135 1140 


Met Arg Gin Ala Gly Leu Leu Ala Val Met Ala .Gin Met Gly Cys 
. 1145 1150 1155 


Tyr Val Pro Ala Glu Val Cys Arg Leu Thr Pro He Asp Arg Val 
1160 1165 1170 

Phe Tlir .;Arg Leu Gly Ala Ser Asp Arg He Met Ser Gly Glu Ser 
1175 " 1180 1185 

Thr Phe Phe Val Glu Leu Ser Glu Thr Ala Ser He Leu Met His 
1190 1195 1200 


Ala Thr Ala His Ser Leu Val Leu Val Asp Glu Leu Gly Arg Gly 
1205 1210 1215 

Thr Ala Thr Phe Asp Gly Thr Ala He Ala Asn Ala Val Val Lys 
1220 1225 1230 

Glu Leu Ala Glu Thr He Lys Cys Arg Thr Leu Phe Ser Thr His 
1235 1240 1245 


Tyr His Ser Leu Val Glu Asp Tyr Ser Gin Asn Val Ala Val Arg 
1250 1255 1260 


Leu Gly His Met Ala Cys Met Val Glu Asn Glu Cys Glu Asp Pro 
1255 1270 1275 
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Ser Gin Glu Thr lie Thr Phe Leu Tyr Lys Phe lie Lys Gly Ala 
1280 1285 1290 


Cys Pro Lys Ser Tyr Gly Phe Asn Ala Ala Arg Leu Ala Asn Leu 
1295 1300 1305 


Pro Glu Glu Val lie Gin Lys Gly His Arg Lys Ala Arg Glu Phe 
1310 .1315 1320 


Glu Lys Met Asn Gin Ser Leu Arg Leu Phe Arg Glu Val Cys Leu 
1325 1330 1335 


Ala Ser Glu Arg Ser Thr Val Asp Ala Glu Ala Val His Lys Leu 
1340 1345 1350 


Leu Thr Leu lie Lys Glu Leu 
1355 1360 

<210> 17 

<211> 1408 

<212> DNA 

<213> Homo sapiens 

<400> 17 

ggcgctccta cctgcaagtg gctagtgcca agtgctgggc cgccgctcct gccgtgcatg 60 

ttggggagcc agtacatgca ggtgggctcc acacggagag gggcgcagac ccggtgacag 120 

ggctttacct ggtacatcgg catggcgcaa ccaaagcaag agagggtggc gcgtgccaga 180 

caccaacggt cggaaaccgc cagacaccaa cggtcggaaa ccgccaagac accaacgctc 240 

ggaaaccgcc agacaccaac gctcggaaac cgccagacac caaggctcgg aa.tccacgcc 300 

aggccacgac ggagggcgac tacctccctt ctgaccctgc tgctggcgtt cggaaaaaac 360 

gcagtccggt gtgctctgat tggtccaggc tctttgacgt cacggactcg acctttgaca 420 

gagccactag gcgaaaagga gagacgggaa gtattttttc cgccccgccc . ggaaagggtg 480 

gagcacaacg tcgaaagcag ccgttgggag cccaggaggc ggggcgcctg tgggagccgt 540 

ggagggaact ttcccagtcc ccgaggcgga tccggtgttg catccttgga gcgagctgag 600 

aactcgagta cagaacctgc taaggccatc aaacctattg atcggaagtc agtccatcag 660 

atttgctctg ggccggtggt accgagtcta aggccgaatg cggtgaagga gttagtagaa 720 

aacagtctgg atgctggtgc cactaatgtt gatctaaagc ttaaggacta tggagtggat 780 

ctcattgaag tttcaggcaa tggatgtggg gtagaagaag aaaacttcga aggctthact 840 

ctgaaacatc acacatgtaa gattcaagag tttgccgacc taactcaggt ggaaactttt 900 

ggctttcggg gggaagctct gagctcacfct tgtgcactga gtgatgtcac catttctacc 960 

tgccgtgtat cagcgaaggt tgggactcga ctggtgtttg atcactatgg gaaaatcatc 1020 
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cagaaaaccc 

cctacccccg 

ccccagaggg atgacagtca gcgtgaagca gttattttct 

1080 

acgctacctg 

tgcaccataa 

agaatttcaa aggaatatta agaagaaacg tgcctgcttc 

1140 

cccttcgcct 

tctgccgtga 

ttgtcagttt cctgaggcct ccccagccat gcttcctgta 

1200 

cagcctgtag 

aactgactcc 

tagaagtacc ccaccccacc cctgctcctt ggaggacaac 

1250 

gtgatcactg 

tattcagctc 

tgtcaagaat ggtccaggtt cttctagatg atctgcacaa 

1320 

atggttcctc 

tcctccttcc 

tgatgtchgc cattagcatt ggaataaagt tcctgctgaa 

1380 

aatccaaaaa 

aaaaaaaaaa 

aaaaaaaa 

1408 


<210> 18 

<211> 389 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Met Ala Gin Pro Lys Gin Glu Arg Val Ala Arg Ala Arg His Gin Arg 
15 10 15 


Ser Glu Thr Ala Arg His Gin Arg Ser Glu Thr Ala Lys Thr Pro Thr 

20 25 30 

Leu Gly Asn Arg Gin Thr Pro Thr Leu Gly Asn Arg Gin Thr Pro Arg 

35 40 45 

Leu Gly He His Ala Arg Pro Arg Arg Arg Ala Thr Thr Ser Leu Leu 

50 55 60 

Thr Leu Leu Leu Ala Phe Gly Lys Asn Ala Val Arg Cys Ala Leu He 


Gly Pro Gly Ser Leu Thr Ser Arg Thr Arg Pro Leu Thr Glu Pro Leu 
85 90 95 


Gly Glu Lys Glu Arg Arg Glu Val Phe Phe Pro Pro Arg Pro Glu Arg 
100 105 110 

Val Glu His Asn Val Glu Ser Ser Arg Trp Glu Pro Arg Arg Arg Gly 
115 120 125 

Ala Cys Gly Ser Arg Gly Gly Asn Phe Pro Ser Pro Arg Gly Gly Ser 
130 135 140 

Gly Val Ala Ser Leu Glu Arg Ala Glu Asn Ser Ser Thr Glu Pro Ala 
145 150 155 160 

Lys Ala He Lys Pro He Asp Arg Lys Ser Val His Gin He Cys Ser 
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Glu Asn Ser Leu Asp Ala Gly Ala Thr Asn Val Asp Leu Lys Leu Lys 
195 200 205 


Asp Tyr Gly Val Asp Leu lie Glu Val Ser Gly Asn Gly Cys Gly Val 
210 215 220 


Glu Glu Glu Asn Phe Glu Gly Phe Thr Leu Lys His His Thr Cys Lys 
225 230 235 240 


lie Gin Glu Phe Ala Asp Leu Thr Gin Val Glu Thr Phe Gly Phe Arg 
245 250 255 


Gly Glu Ala Leu Ser Ser Leu Cys Ala Leu Ser Asp Val Thr lie Ser 
260 265 270 


Tyr Gly Lys He He Gin Lys Thr Pro Tyr Pro Arg Pro Arg Gly Met 
290 295 300 


Glu Phe Gin Arg Asn He Lys Lys Lys Arg Ala Cys Phe Pro Phe Ala 
325 330 335 


Phe Cys Arg Asp Cys Gin Phe Pro Glu Ala Ser Pro Ala Met Leu Pro 
340 345 350 


Val Gin Pro Val Glu Leu Thr Pro Arg Ser Thr Pro Pro His Pro Cys 
355 360 365 


Pro Gly Ser Ser Arg 
385 


<212> DWA 

<213> Homo sapiens 
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<400>- 19 

tttttagaaa ctgatgttta ttttccatca accatttttc catgctgctt aagagaatat 60 

gcaagaacag cttaagacca gtcagtggtt gctcctaccc attcagtggc ctgagcagtg 120 

gggagctgca gaccagtctt ccgtggcagg ctgagcgctc cagtcttcag tagggaattg 180 

ctgaataggc acagagggca cctgtacacc ttcagaccag tctgcaacct caggctgagt 240 

agcagtgaac tcaggagcgg gagcagtcca ttcaccctga aattcctcct tggtcactgc 300 

cttctcagca gcagcctgct cttctttttc aatctcttca ggatctctgt agaagtacag 360 

atcag-gcatg acctcccatg ggtgttcacg ggaaatggtg ccacgcatgc gcagaacttc 420 

ccgagccagc atccaccaca ttaaacccac tgagtgagct cccttgttgt tgcatgggat 480 

ggcaatgtcc acatagcgca gaggagaatc tgtgttacac agcgcaatgg taggtaggtt 540 

aacataagat gcctccgtga gaggcgaagg ggcggcggga cccgggcctg gcccgtatgt 600 

gtccttggcg gcctagacta ggccgtcgct gtatggtgag ccccagggag gcggatctgg 660 

gcccccagaa ggacacccgc ctggatttgc cccgtagccc ggcccgggcc cctcgggagc 720 

agaacagcct tggtgaggtg gacaggaggg gacctcgcga gcagacgcgc gcgccagcga 780 

cagcagcccc gccccggcct ctcgggagcc ggggggcaga ggctgcggag ccccaggagg 840 

gtctatcagc cacagtctct gcatgtttcc aagagcaaca ggaaatgaac acattgcagg 900 

ggccagtgtc attcaaagat gtggctgtgg atttcaccca ggaggagtgg cggcaactgg 960 

accctgatga gaagatagca tacggggatg tgatgttgga gaactacagc catctagttt 1020 

ctgtggggt-a tgattatcac caagccaaac atcatcatgg agtggaggtg aaggaagtgg 1080 

agcagggaga ggagccgtgg ataatggaag gtgaatttcc atgtcaacat agtccagaac 1140 

ctgctaaggc catcaaacct attgatcgga agtcagtcca tcagatttgc tctgggccag 1200 

tggtactgag tctaagcact gcagtgaagg agttagtaga aaacagtctg gatgctggtg 1260 

ccactaatat tgatctaaag cttaaggact atggagtgga tctcattgaa gtttcagaca 1320 

atggatgtgg ggtagaagaa gaaaactttg aaggcttaat ctctttcagc tctgaaacat 1380 

cacacatgta agattcaaga gtttgccgac ctaactgaag ttgaaacttt cggttttcag 1440 

ggggaagctc tgagctcact gtgtgcactg agcgatgtca ccatttctac ctgccacgcg 1500 

ttggtgaagg ttgggactcg actggtgttt gatcacgatg ggaaaatcat ccaggaaacc 1560 

ccctaccccc accccagagg gaccacagtc agcgtgaagc agttattttc tacgctacct 1620 

gtgcgccata aggaatttca aaggaatatt aagaagacgt gcctgcttcc ccttcgcctt 1680 

ctgccgtgat tgtcagtttc ctgaggcctc cccagccatg cttcctgtac agcctgcaga 1740 

actgtgagtc aattaaacct cttttcttca taaattaaaa aaaaa 1785 


<210> 20 
<211> 264 
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<212> PRT 

<213> Homo sapiens 

<400> 20 

Met Cys Pro Trp Arg Pro Arg Leu Gly Arg Arg Cys Met Val Ser Pro 
1 5 10 15 

Arg Glu Ala Asp Leu Gly Pro Gin Lys Asp Thr Arg Leu Asp Leu Pro 


Arg Ser Pro Ala Arg Ala Pro Arg Glu Gin Asn Ser Leu Gly Glu Val 
35 40 45 


Asp Arg Arg Gly Pro Arg Glu Gin Thr Arg Ala Pro Ala Thr Ala Ala 


Pro Pro Arg Pro Leu Gly Ser Arg Gly Ala Glu Ala Ala Glu Pro Gin 
65 70 75 80 


Glu Gly Leu Ser Ala Thr Val Ser Ala Cys Phe Gin Glu Gin Gin Glu 
85 90 95 


Met Asn Thr Leu Gin Gly Pro Val Ser Phe Lys Asp Val Ala Val Asp 
100 105 110 


Phe Thr Gin Glu Glu Trp Arg Gin Leu Asp Pro Asp Glu Lys lie Ala 
115 120 125 


Tyr Gly Asp Val Met Leu Glu Asn Tyr Ser His Leu Val Ser Val Gly 
130 135 140 


Tyr Asp Tyr His Gin Ala Lys His His His Gly Val Glu Val Lys Glu 
145 150 155 160 


Val Glu Gin Gly Glu Glu Pro Trp lie Met Glu Gly Glu Phe Pro Cys 
165 170 175 


Gin His Ser Pro Glu Pro Ala Lys Ala lie Lys Pro lie Asp Arg Lys 
180 185 190 


Ser Val His Gin He Cys Ser Gly Pro Val Val Leu Ser Leu Ser Thr 
195 200 205 


Ala Val Lys Glu Leu Val Glu Asn Ser Leu Asp Ala Gly Ala Thr Asn 
210 215 220 


He Asp Leu Lys Leu Lys Asp Tyr Gly Val Asp Leu He Glu Val Ser 
225 230 235 240 
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Asp hsn Gly Cys Gly Val Glu Glu Glu Asn Phe Glu Gly Leu lie Ser 
245 250 255 


Phe Ser Ser Glu Thr Ser His Met 
260 


<210> 21 

<2H> 795 

<212> DNA 

<213> Homo sapiens 

<400> 21 


atgtgtcctt 

ggcggcctag 

actaggccgt 

cgctgtatgg tgagccccag 

ggaggcggat 

60 

ctgggccccc 

agaaggacac 

ccgcctggat 

ttgccccgta gcccggcccg 

ggcccctcgg 

120 

gagcagaaca 

gccttggtga 

ggtggacagg 

sg9993-cctc gcgagcagac 

gcgcgcgcca 

180 

gcgacagcag 

ccccgccccg 

gcctctcggg 

agccgggggg cagaggctgc 

ggagccccag 

240 

gagggtctat 

cagccacagt 

ctctgcatgt 

ttccaagagc aacaggaaat 

gaacacattg 

300 

caggggccag 

tgtcattcaa 

agatgtggct 

gtggatttca cccaggagga 

gtggcggcaa 

360 

ctggaccctg 

atgagaagat 

agcatacggg 

gatgtgatgt tggagaacta 

cagccatcta 

420 

gtttctgtgg 

ggtatgatta 

tcaccaagcc 

aaacatcatc atggagtgga 

ggtgaaggaa 

480 

gtggagcagg 

gagaggagcc 

gtggataatg 

gaaggtgaat ttccatgtca 

acatagtcca 

540 

gaacctgcta 

aggccatcaa 

acctattgat 

cggaagtcag tccatcagat 

ttgctctggg 

600 

ccagtggtac 

tgagtctaag 

cactgcagtg 

aaggagttag tagaaaacag 

tctggatgct 

660 

ggtgccacta 

atattgatct 

aaagcttaag 

gactatggag tggatctcat 

tgaagtttca 

720 

gacaatggat 

gtggggtaga 

agaagaaaac 

tttgaaggct taatctcttt 

cagctctgaa 

780 

acatcacaca 

tgtaa 




795 


<210> 22 

<211> 264 

<212> PRT 

<213> Homo sapiens 

<400> 22 

Met Cys Pro Trp Arg Pro Arg Leu Gly Arg Arg Cys Met Val Ser Pro 
15 10 15 

Arg Glu Ala Asp Leu Gly Pro Gin Lys Asp Thr Arg Leu Asp Leu Pro 
20 25 30 

Arg Ser Pro Ala Arg Ala Pro Arg Glu Gin Asn Ser Leu Gly Glu Val 
35 40 ■ 45 

Asp Arg Arg Gly Pro Arg Glu Gin Thr Arg Ala Pro Ala Thr Ala Ala 
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Pro Pro Arg Pro Leu Gly Ser Arg Gly Ala Glu Ala Ala Glu Pro Gin 


Glu Gly Leu Ser Ala Thr Val Ser Ala Cys Phe Gin Glu Gin Gin Glu 


Met Asn Thr Leu Gin Gly Pro Val Ser Phe Lys Asp Val Ala Val Asp 
100 105 110 


Phe Thr Gin Glu Glu Trp Arg Gin Leu Asp Pro Asp Glu Lys lie Ala 
115 120 125 


Tyr Gly Asp Val Met Leu Glu Asn Tyr Ser His Leu Val Ser Val Gly 
130 135 140 

Tyr Asp Tyr His Gin Ala Lys His His His Gly Val Glu Val Lys Glu 
145 150 155 160 


Val Glu Gin Gly Glu Glu Pro Trp lie Met Glu Gly Glu Phe Pro Cys 
165 170 175 


Gin His Ser Pro Glu Pro Ala Lys Ala lie Lys Pro lie Asp Arg Lys 
180 185 190 


Ser Val His Gin lie Cys Ser Gly Pro Val Val Leu Ser Leu Ser Thr 
195 200 205 


Ala Val Lys Glu Leu Val Glu Asn Ser Leu Asp Ala Gly Ala Thr Asn 

210 215 220 


lie Asp Leu Lys Leu Lys Asp Tyr Gly Val Asp Leu He Glu Val Ser 
225 230 235 240 


Asp Asn Gly Cys Gly Val . Glu Glu Glu Asn Phe Glu Gly Leu He Ser 
245 250 255 


Phe Ser Ser Glu Thr Ser His Met 
260 


<210> 23 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide primer 
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acgcatatgg agcgagctga gagctcgagt 


30 


<210> 24 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide primer 

<400> 24 

gaattcttat cacgtagaat cgagaccgag gagagggtta gggataggct taccagttcc 60 
aaccttcgcc gatgc 75 


<210> 25 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide primer 


<210> 26 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide primer 
<400> 26 

gaattcttat tacgtagaat cgagaccgag gagagggtta gggataggct tacccatgtg 60 
tgatgtttca gagct 75 


<210> 27 

<211> 49 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide primer 
<220> 

<221> misc_feature 

<222> (35).. (35) 

<223> corresponds to the first nucleotide of the first codon of the tar 
get mismatch repair gene 


<220> 

<221> misc_feature 
<222> (36) . . (36) 

<223> corresponds to the second nucleotide of the first codon of the ta 
rget mismatch repair gene 


<400>- 25 

acgcatatgt gtccttggcg gcctaga 


27 


Page 


48 
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<220> 

<221> misc_feature 
<222> (37) . . (37) 

<223> corresponds to the third nucleotide of the first codon of the tar 
get mismatch repair gene 

<220> 

<221> misa_feature 
<222> (38) . . (38) 

<223> corresponds to the first nucleotide of the second codon of the ta 
rget mismatch repair gene 

<220> 

<221> misc_feature 
<222> (39) . . (39) 

<223> corresponds to the second nucleotide of the second codon of the t 
arget mismatch repair gene 

<220> 

<221> misc_feature 
<222> (40) . . (40) 

<223> corresponds to the third nucleotide of the second codon of the ta 
rget mismatch repair gene 

<220> 

<221> tnisc_f eature 
<222> (41) . . (41) 

<223> corresponds to the first nucleotide of the third codon of the tar 
get mismatch repair gene 

<220> 

<221> misc_feature 
<222> (42).. (42) 

<223> corresponds to the second nucleotide of the third codon of the ta 
rget mismatch repair gene 

<220> 

<221>. misc_feature 
<222> (43) . . (43) 

<223> corresponds to the third nucleotide of the third codon of the tar 
get mismatch repair gene 

<220> 

<221> misc_feature 
<222> (44).. (44) 

<223> corresponds to the first nucleotide of the fourth codon of the ta 
rget mismatch repair gene 

<220> 

<221> misc_feature 
<222> (45) . . (45) 

<223> corresponds to the second nucleotide of the fourth codon of the t 
arget mismatch repair gene 
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<220> 

<221> misc_f eature 
<222> (46) . . (46) 

<223> corresponds to the third nucleotide of the fourth codon of the ta 
rget mismatch repair gene 

<220> 

<221> misc_f eature 
<222> (47) . . (47) 

<223> corresponds to the first nucleotide of the fifth codon of the tar 
get misinatch repair gene 

<220> 

<221> misc_f eature 
<222> (48) . . (48) 

<223> corresponds to the second nucleotide of the fifth codon of the ta 
rget mismatch repair gene 

<220> 

<221> misc_feature 
<222> (49) . . (49) 

<223> corresponds to the third nucleotide of the fifth codon of the tar 
get mismatch repair gene 

<400> 27 

tttaatacga ctcactatag ggagaccacc atggnnnnnn nnnnnnnnn 49 

<210> 28 

<211> 4290 

<212> DNA 

<213> Homo sapiens 

<400> 28 

atgatcaagt gcttgtcagt tgaagtacaa gccaaattgc gttctggttt ggccataagc 60 

tccttgggcc aatgtgttga ggaacttgcc ctcaacagta ttgatgctga agcaaaatgt 120 

gtggctgtca gggtgaatat ggaaaccttc caagttcaag tgatagacaa tggatttggg 180 

atggggagtg atgatgtaga gaaagtggga aatcgttatt tcaccagtaa atgccactcg 240 

gtacaggact tggagaatcc aaggttttat ggtttccgag gagaggcctt ggcaaatatt 300 

gctgacatgg ccagtgctgt ggaaatttcg tccaagaaaa acaggacaat gaaaactttt 360 

gtgaaactgt ttcagagtgg aaaagccctg aaagcttgtg aagctgatgt gactagagca 420 

agcgctggga ctactgtaac agtgtataac ctattttacc agcttcctgt aaggaggaaa 48 0 

tgcatggacc ctagactgga gtttgagaag gttaggcaga gaatagaagc tctctcactc 540 

atgcaccctt ccatttcttt ctctttgaga aatgatgttt ctggttccat ggttctbcag 600 

ctccctaaaa ccaaagacgt atgttcccga ttttgtcaaa tttatggatt gggaaagtcc 660 

caaaagctaa gagaaataag ttttaaatat aaagagtttg agcttagtgg ctatatcagc 720 
tctgaagcac attacaacaa gaatatgcag tttttgtttg tgaacaaaag actagtttta 


780 
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aggacaaagc tacataaact cattgacttt ttattaagga aagaaagtat tatatgcaag 840 

ccaaagaatg gtcccaccag taggcaaatg aattcaagtc ttcggcaccg gtctacccca 900 

gaactctatg gcatatatgt aattaatgtg cagtgccaat tctgtgagta tgatgtgtgc 9&0 

atggagccag ccaaaactct gattgaattt cagaactggg acactctctt gttttgcatt 1020 

caggaaggag tgaaaatgtt tttaaagcaa gaaaaattat ttgtggaatt atcaggtgag 1080 

gatattaagg aatttagtga agataatggt tttagtttat ttgatgctac tcttcagaag 1140 

cgtgtgactt ccgatgagag gagcaatttc caggaagcat gtaataatat tttagattcc 12 00 

tatgagatgt ttaatttgca gtcaaaagct gtgaaaagaa aaactactgc agaaaacgta 1260 

aacacacaga gttctaggga ttcagaagct accagaaaaa atacaaatga tgcatttttg 1320 

tacatttatg aatcaggtgg tccaggccat agcaaaatga cagagccatc tttacaaaac 13 80 

aaagacagct cttgctcaga atcaaagatg ttagaacaag agacaattgt agcatcagaa 1440 

gctggagaaa atgagaaaca taaaaaatct ttcctggaac atagctcttt agaaaatccg 1500 

tgtggaacca gtttagaaat gtttttaagc ccttttcaga caccatgtca ctttgaggag 1560 

agtgggcagg atctagaaat atggaaagaa agtactactg ttaatggcat ggctgccaac 1620 

atcttgaaaa ataatagaat tcagaatcaa ccaaagagat ttaaagatgc tactgaagtg 1680 

ggatgccagc ctctgccttt tgcaacaaca ttatggggag tacatagtgc tcagacagag 1740 

aaagagaaaa aaaaagaatc tagcaattgt ggaagaagaa atgtttttag ttatgggcga 1800 

gttaaattat gttccactgg ctttataact catgtagtac aaaatgaaaa aactaaatca 1860 

actgaaacag aacattcatt taaaaattat gttagacctg gtcccacacg tgcccaagaa 1920 

acatttggaa atagaacacg tcattcagtt gaaactccag acatcaaaga tttagccagc 1980 

actttaagta aagaatctgg tcaattgccc aacaaaaaaa attgcagaac gaatataagt 2040 

tatgggctag agaatgaacc tacagcaact tatacaatgt tttctgcttt tcaggaaggt 2100 

agcaaaaaat cacaaacaga ttgcatatta tctgatacat ccccctcttt cccctggtat 2160 

agacacgttt ccaatgatag taggaaaaca gataaattaa ttggtttctc caaaccaatc 2220 

gtccgtaaga agctaagctt gagttcacag ctaggatctt tagagaagtt taagaggcaa 2280 

tatgggaagg ttgaaaatcc tctggataca gaagtagagg aaagtaatgg agtcactacc 2340 

aatctcagtc ttcaagttga acctgacatt ctgctgaagg acaagaaccg cttagagaac 2400 

tctgatgttt gtaaaatcac tactatggag catagtgatt cagatagtag ttgtcaacca 24 60 

gcaagccaca tccttgactc agagaagttt ccattctcca aggatgaaga ttgtttagaa 2520 

caacagatgc ttagtttgag agaaagtcct atgaccctga aggagttatc tctctttaat 2580 

agaaaacctt tggaccttga gaagtcatct gaatcactag cctctaaatt atccagactg 2640 

aagggttccg aaagagaaac tcaaacaatg gggatgatga gtcgttttaa tgaacttcca 2700 
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aattcagatt ccagtaggaa agacagcaag ttgtgcagtg tgttaacaca agatttttgt 2760 

atgttattta acaacaagca tgaaaaaaca gagaatggtg tcatcccaac atcagattct 2820 

gccacacagg ataattcctt taataaaaat agtaaaacac attctaacag caatacaaca 2880 

gagaactgtg tgatatcaga aactcctttg gtattgccct ataataattc taaagttacc 2940 

ggtaaagatt cagatgttct tatcagagcc tcagaacaac agataggaag tcttgactct 3000 

cccagtggaa tgttaatgaa tccggtagaa gatgccacag gtgaccaaaa tggaatttgt 30S0 

tttcagagtg aggaatctaa agcaagagct tgttatgaaa ctgaagagtc aaacacgtgt 3120 

tgttcagatt ggcagcggca tttcgatgta gccctgggaa gaatggttta tgtcaacaaa 3180 

atgactggac tcagcacatt cattgcccca actgaggaca ttcaggctgc ttgtactaaa 3240 

gacctgacaa ctgtggctgt ggatgttgta cttgagaatg ggtctcagta caggtgtcaa 3300 

ccttttagaa gcgaccttgt tcttcctttc cttccgagag ctcgagcaga gaggactgtg 3360 

atgagacagg ataacagaga tactgtggat gatactgtta gtagcgaatc gcttcagtct 3420 

ttgttctcag aatgggacaa tccagtattt gcccgttatc cagaggttgc tgttgatgta 3480 

agcagtggcc aggctgagag cttagcagtt aaaattcaca acatcttgta tccctatcgt 3540 

ttcaccaaag gaatgattca ttcaatgcag gttctccagc aagtagataa caagtttatt 3600 

gcctgtttga tgagcactaa gactgaagag aatggcgagg cagattccta cgagaagcaa 3 660 

caggcacaag gctctggtcg gaaaaaatta ctgtcfctcta ctctaattcc tccgctagag 3720 

ataacagtga cagaggaaca aaggagactc ttatggtgtt accacaaaaa tctggaagat 3780 

ctgggccttg aatttgtatt tccagacact agtgattctc tggtccttgt gggaaaagta 3840 

ccactatgtt ttgtggaaag agaagccaat gaacttcgga gaggaagatc tactgtgacc 3900 

aagagtattg tggaggaatt tatccgagaa caactggagc tactccagac caccggaggc 3960 

atccaaggga cattgccact gactgtccag aaggtgttgg catcccaagc ctgccatggg 4020 

gccattaagt ttaatgatgg cctgagctta caggaaagtt gccgccttat tgaagctctg 4080 

tcctcatgcc agctgccatt ccagtgtgct cacgggagac cttctatgct gccgttagct 4140 
gacatagacc acttggaaca ggaaaaacag attaaaccca acctcactaa acttcgcaaa 42 00 
atggcccagg cctggcgtct ctttggaaaa gcagagtgtg atacaaggca gagcctgcag 4250 
caatccatgc ctccctgtga gccaccatga 4290 

<210> 29 

<211> 1429 

<212> PHT 

<213> Homo sapiens 

<400> 29 

Met lie Lys Cys Leu Ser Val Glu Val Gin Ala Lys Leu Arg Ser Gly 
1 5 10 15 
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Leu Ala lie Ser Ser Leu Gly Gin Cys Val Glu Glu Leu Ala Leu Asn 
20 25 30 

Ser He Asp Ala Glu Ala Lys Cys Val Ala Val Arg Val Asn Met Glu 
35 40 45 

Thr Phe Gin Val Gin Val lie Asp Asn Gly Phe Gly Met Gly Ser Asp 
50 55 60 

Asp Val Glu Lys Val Gly Asn Arg Tyr Phe Thr Ser Lys Cys His Ser 
65 70 75 80 

Val Gin Asp Leu Glu Asn Pro Arg Phe Tyr Gly Phe Arg Gly Glu Ala 


Leu Ala Asn lie Ala Asp Met Ala Ser Ala Val Glu He Ser Ser Lys 
100 105 110 

Lys Asn Arg Thr Met Lys Thr Phe Val Lys Leu Phe Gin Ser Gly Lys 
115 120 125 

Ala Leu Lys Ala Cys Glu Ala Asp Val Thr Arg Ala Ser Ala Gly Thr 
130 135 140 


Thr Val Thr Val Tyr Asn Leu Phe Tyr Gin Leu Pro Val Arg Arg Lys 
145 150 155 160 

Cys Met Asp Pro Arg Leu Glu Phe Glu Lys Val Arg Gin Arg He Glu 
165 170 ■ 175 

Ala Leu Ser Leu Met His Pro Ser He Ser Phe Ser Leu Arg Asn Asp 
180 185 190 

Val Ser Gly Ser Met Val Leu Gin Leu Pro Lys Thr Lys Asp Val Cys 
195 200 205 


Ser Arg Phe Cys Gin He Tyr Gly Leu Gly Lys Ser Gin Lys Leu Arg 
210 215 220 

Glu He ser Phe Lys Tyr Lys Glu Phe Glu Leu Ser Gly Tyr He Ser 
225 230 235 240 

Ser Glu Ala His Tyr Asn Lys Asn Met Gin Phe Leu Phe Val Asn Lys 
245 250 255 


Arg Leu Val Leu Arg Thr Lys Leu His Lys Leu He Asp Phe Leu Leu 
260 265 270 
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Gin Met Asn Ser Ser Leu Arg His Arg Ser Thr Pro Glu Leu Tyr Gly 
290 295 300 


He Tyr Val He Asn Val Gin Cys Gin Phe Cys Glu Tyr Asp Val Cys 
305 310 315 320 


Met Glu Pro Ala Lys Thr Leu He Glu Phe Gin Asn Trp Asp Thr Leu 
325 330 335 


Leu Phe Cys He Gin Glu Gly Val Lys Met Phe Leu Lys Gin Glu Lys 
340 345 350 


Leu Phe Val Glu Leu Ser Gly Glu Asp He Lys Glu Phe Ser Glu Asp 
355 360 365 


Asn Gly Phe Ser Leu Phe Asp Ala Thr Leu Gin Lys Arg Val Thr Ser 
370 375 380 


Asp Glu Arg Ser Asn Phe Gin Glu Ala Cys Asn Asn He Leu Asp Ser 
385 390 395 400 


Tyr Glu Met Phe Asn Leu Gin Ser Lys Ala Val Lys Arg Lys Thr Thr 
405 410 415 


Ala Glu 'Asn Val Asn Thr Gin Ser Ser Arg Asp Ser Glu Ala Thr Arg 
420 425 430 


Lys Asn Thr Asn Asp Ala Phe Leu Tyr He Tyr Glu Ser Gly Gly Pro 
435 440 445 


Gly His Ser Lys Met Thr Glu Pro Ser Leu Gin Asn Lys Asp Ser Ser 

450 455 460 


Cys Ser Glu Ser Lys Met Leu Glu Gin Glu Thr He Val Ala Ser Glu 
465 470 475 480 


Ala Gly Glu Asn Glu Lys His Lys Lys Ser Phe Leu Glu His Ser Ser 
485 490 495 


Leu Glu Asn Pro Cys Gly Thr Ser Leu Glu Met Phe Leu Ser Pro Phe 
500 505 510 


Gin Tht Pro Cys His Phe Glu Glu Ser Gly Gin Asp Leu Glu He Trp 
515 520 525 


Page 54 


wo 03/012130 


PCT/USOl/23888 


Asn Arg lie Gin Asn Gin Pro Lys Arg Eha Lys Asp Ala Thr Glu Val 
545 550 555 560 


Gly Cys Gin Pro Leu Pro Phe Ala Thr Thr Leu Trp Gly Val His Ser 
565 570 575 


Ala Gin Thr Glu Lys Glu Lys Lys Lys Glu Ser Ser Asn Cys Gly Arg 
580 585 590 


Arg Asn Val Phe Ser Tyr Gly Arg Val Lys Leu Cys Ser Thr Gly Phe 
595 600 605 


He Thr His Val Val Gin Asn Glu Lys Thr Lys Ser Thr Glu Thr Glu 
610 615 620 


His Ser Phe Lys Asn Tyr Val Arg Pro Gly Pro Thr Arg Ala Gin Glu 
625 630 635 640 


Thr Phe Gly Asn Arg Thr Arg His Ser Val Glu Thr Pro Asp He Lys 
645 650 655 


Lys Asn Cys Arg Thr Asn He Ser Tyr Gly Leu Glu Asn Glu Pro Thr 
675 680 585 


Ala Thr Tyr Thr Met Phe Ser Ala Phe Gin Glu Gly Ser Lys Lys Ser 
690 695 700 


Gin Thr Asp Cys He Leu Ser Asp Thr Ser Pro Ser Phe Pro Trp Tyr 
705 710 715 720 


Ser Leu Glu Lys Phe Lys Arg Gin Tyr Gly Lys Val Glu Asn Pro Leu 
755 760 765 


Asp Thr Glu Val Glu Glu Ser Asn Gly Val Thr Thr Asn Leu Ser Leu 
770 775 780 
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Gin Val Glu Pro Asp lie Leu Leu Lys Asp Lys Asn Arg Leu Glu Asn 
785 790 795 800 


Ser Asp Val Cys Lys lie Thr Thr Met Glu His Ser Asp Ser Asp Ser 
805 810 815 


Ser Cys Gin Pro Ala Ser His He Leu Asp Ser Glu Lys Phe Pro Phe 
820 825 830 


Ser Lys Asp Glu Asp Cys Leu Glu Gin Gin Met Leu Ser Leu Arg Glu 
835 840 845 


Ser Pro Met Thr Leu Lys Glu Leu Ser Leu Phe Asn Arg Lys Pro Leu 
850 855 860 


Asp Leu Glu Lys Ser Ser Glu Ser Leu Ala Ser Lys Leu Ser Arg Leu 
865 870 875 880 


Lys Gly Ser Glu Arg Glu Thr Gin Thr Met Gly Met Met Ser Arg Phe 
885 890 895 


Asn Glu Leu Pro Asn Ser Asp Ser Ser Arg Lys Asp Ser Lys Leu Cys 
900 905 910 


Lys Thr Glu Asn Gly Val lie Pro Thr Ser Asp Ser Ala Thr Gin Asp 
930 935 940 


Asn Ser Phe Asn Lys Asn Ser Lys Thr His Ser Asn Ser Asn Thr Thr 
945 950 955 960 


Glu Asn Cys Val He Ser Glu Thr Pro Leu Val Leu Pro Tyr Asn Asn 

965 970 975 


Ser Lys Val Thr Gly Lys Asp Ser Asp Val Leu He Arg Ala Ser Glu 
980 985 990 

Gin Gin He Gly Ser Leu Asp Ser Pro Ser Gly Met Leu Met Asn Pro 
995 1000 1005 


Val Glu Asp Ala Thr Gly Asp Gin Asn Gly He Cys Phe Gin Ser 
1010 1015 1020 


Glu Glu Ser Lys Ala Arg Ala Cys Ser Glu Thr Glu Glu Ser Asn 
1025 1030 1035 
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Thr Cys Cys Ser Asp Trp Gin Arg His Phe Asp Val Ala Leu Gly 
1040 1045 1050 


Arg Met Val Tyr Val Asn Lys Met Thr Gly Leu Ser Thr Phe He 
1055 1060 10S5 


Ala Pro Thr Glu Asp lie Gin Ala Ala Cys Thr Lys Asp Leu Thr 
1070 1075 1080 


Thr Val Ala Val Asp Val Val Leu Glu Asn Gly Ser Gin Tyr Arg 
1085 1090 1095 


Cys Gin Pro Phe Arg Ser Asp Leu Val Leu Pro Phe Leu Pro Arg 
1100 1105 1110 


Ala Arg Ala Glu Arg Thr Val Met Arg Gin Asp Asn Arg Asp Thr 
1115 1120 1125 


Val Asp Asp Thr Val Ser Ser Glu Ser Leu Gin Ser Leu Phe Ser 
1130 1135 1140 


Glu Trp Asp Asn Pro Val Phe Ala Arg Tyr Pro Glu Val Ala Val 
1145 1150 1155 


Asp Val Ser Ser Gly Gin Ala Glu Ser Leu Ala Val Lys He His 
1160 1165 1170 


Asn He Leu Tyr Pro Tyr Arg Phe Thr Lys Gly Met He His Ser 
1175 1180 1185 


Met Gin Val Leu Gin Gin Val Asp Asn Lys Phe He Ala Cys Leu 
1190 1195 1200 


Lys Gin Gin Ala Gin Gly Ser Gly Arg Lys Lys Leu Leu Ser Ser 
1220 1225 1230 


Thr Leu He Pro Pro Leu Glu He Thr Val Thr Glu Glu Gin Arg 
1235 1240 1245 


Arg Leu Leu Trp Cys Tyr His Lys Asn Leu Glu Asp Leu Gly Leu 
1250 1255 1260 


Glu Phe Val Phe Pro Asp Thr Ser Asp Ser Leu Val Leu Val Gly 
1265 1270 1275 
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Thr Leu Pro Leu Thr Val Gin Lys Val Leu Ala Ser Gin Ala Cys 
1325 1330 1335 


Cys Arg Leu lie Glu Ala Leu Ser Ser Cys Gin Leu Pro Phe Gin 
1355 1360 1365 


Cys Ala His Gly Arg Pro Ser Met Leu Pro Leu Ala Asp lie Asp 
1370 1375 1380 


Asp Thr Arg Gin Ser Leu Gin Gin Ser Met Pro Pro Cys Glu Pro 
1415 1420 1425 


<212> DMA 

<213> Arabidopsis thaliana 
<400> 30 

atgcaaggag attcttctcc gtctccgacg actactagct ctcctttgat aagacctata 60 

.aacagaaacg taattcacag aatctgttcc ggtcaagtca tcttagacct ctcttcggcc 120 

gtcaaggagc ttgtcgagaa tagtctcgac gccggcgcca ccagtataga gattaacctc 180 

cgagactacg gcgaagacta ttttcaggtc attgacaatg gttgtggcat ttccccaacc 240 

aatttcaagg tttgtgtcca aattctccga agaacttttg atgttcttgc acttaagcat 300 

catacttcta aattagagga tttcacagat cttttgaatt tgactactta tggttttaga 360 
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ggagaagcct tgagctctct ctgtgcattg ggaaatctca ctgtggaaac aagaacaaag 420 

aatgagccag ttgctacgct cttgacgttt gatcattctg gtttgcttac tgctgaaaag 480 

aagactgctc gccaaattgg taccactgtc actgttagga agttgttctc taatttacct 540 

gtacgaagca aagagtttaa gcggaatata cgcaaagaat atgggaagct tgtatcttta SOO 

ttgaacgcat atgcgcttat tgcgaaagga gtgcggtttg tctgctctaa cacgactggg 660 

aaaaacccaa agtctgttgt gctgaacaca caagggaggg gttcacttaa agataatatc 720 

ataacagttt tcggcattag tacctttaca agtctacagc ctggtactgg acgcaattta 780 

gcagatcgac agtatttctt tataaatggt cggcctgtag atatgccaaa agtcagcaag 840 

ttggtgaatg agttatataa agatacaagt tctcggaaat atccagttac cattctggat 900 

tttattgtgc ctggtggagc atgtgatttg aatgtcacgc ccgataaaag aaaggtgttc 960 

ttttctgacg agacttctgt taCcggttct ttgagggaag gtctgaacga gatatattcc 1020 

tccagtaatg cgtcttatat tgttaatagg ttcgaggaga attcggagca accagataag 1080 

gctggagttt cgtcgtttca gaagaaatca aatcttttgt cagaagggat agttctggat 1140 

gtcagttcta aaacaagact aggggaagct attgagaaag aaaatccatc cttaagggag 1200 

gttgaaattg ataatagttc gccaatggag aagtttaagt ttgagatcaa ggcatgtggg 1260 

acgaagaaag gggaaggttc tttatcagtc catgatgtaa cbcaccttga caagacacct 132 0 

agcaaaggtt tgcctcagtt aaatgtgact gagaaagtta ctgatgcaag taaagacttg 136 0 

agcagccgct ctagctttgc ccagtcaact ttgaatactt ttgttaccat gggaaaaaga 144 0 

aaacatgaaa acataagcac catcctctct gaaacacctg tcctcagaaa ccaaacttct 1500 

agttatcgtg tggagaaaag caaatttgaa gttcgtgcct tagcttcaag gtgtctcgtg 1560 

gaaggcgatc aacttgatga tatggtcatc tcaaaggaag atatgacacc aagcgaaaga 1620 

gattctgaac taggcaatcg gatttctcct ggaacacaag ctgataatgt tgaaagacat 1680 

gagagagtac tcgggcaatt caatcttggg ttcatcattg caaaattgga gcgagatctg 1740 

ttcattgtgg atcagcatgc agctgatgag aaattcaact tcgaacattt agcaaggtca 1800 

actgtcctga accagcaacc cttactccag cctttgaact tggaactctc tccagaagaa 1860 

gaagtaactg tgttaatgca catggatatt atcagggaaa atggctttct tctagaggag 1920 

aatccaagtg ctcctcccgg aaaacacttt agactacgag ccattcctta tagcaagaat 1980 

atcacctttg gagtcgaaga tcttaaagac ctgatctcaa ctctaggaga taaccatggg 2040 

gaatgttcgg ttgctagtag ctacaaaacc agcaaaacag attcgatttg tccatcacga 2100 

gtccgtgcaa tgctagcatc ccgagcatgc agatcatctg tgatgatcgg agatccactc 2160 

agaaaaaacg aaatgcagaa gatagtagaa cacttggcag atctcgaatc tccttggaat 2220 

tgcccacacg gacgaocaac aatgcgtcat cttgtggact tgacaacttt actcacatta 2280 
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cctgatgacg acaatgtcaa tgatgatgat gatgatgatg caaccatctc attggcatga 2340 

<210> 31 
<211> 779 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 31 

Met Gin Gly Asp Ser Ser Pro Ser Pro Thr Thr Thr Ser Ser Pro Leu 


lie Arg Pro lie Asn Arg Asn Val He His Arg He Cys Ser Gly Gin 

20 25 30 

Val He Leu Asp Leu Ser Ser Ala Val Lys Glu Leu Val Glu Asn Ser 
35 40 45 

Leu Asp Ala Gly Ala Thr Ser He Glu He Asn Leu Arg Asp Tyr Gly 
50 55 60 

Glu Asp Tyr Phe Gin Val He Asp Asn Gly Cys Gly He Ser Pro Thr 


Asn Phe Lys Val Cys Val Gin He Leu Arg Arg Thr Phe Asp Val Leu 
85 90 95 

Ala Leu Lys His His Thr Ser Lys Leu Glu Asp Phe Thr Asp Leu Leu 
100 105 110 


Asn Leu "Ehr Thr Tyr Gly Phe Arg Gly Glu Ala Leu Ser Ser Leu Cys 
115 120 125 


Ala Leu Gly Asn Leu Thr Val Glu Thr Arg Thr Lys Asn Glu Pro Val 
130 135 140 


Ala Thr Leu Leu Thr Phe Asp His Ser Gly Leu Leu Thr Ala Glu Lys 

145 150 155 160 

Lys Thr Ala Arg Gin He Gly Thr Thr Val Thr Val Arg Lys Leu Phe 
165 170 175 


Ser Asn Leu Pro Val Arg Ser Lys Glu Phe Lys Arg Asn He Arg Lys 
180 185 190 


Glu Tyr Gly Lys Leu Val Ser Leu Leu Asn Ala Tyr Ala Leu He Ala 
195 200 205 


Lys Gly Val Arg Phe Val Cys Ser Asn Thr Thr Gly Lys Asn Pro Lys 
210 215 220 
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Ser Val Val Leu Asn Thr. Gin Gly Arg Gly Ser Leu Lys Asp Asn lie 
225 230 235 240 


Gly Arg Asn Leu Ala Asp Arg Gin Tyr Phe Phe lie Asn Gly Arg Pro 
260 265 270 


Val Asp Met Pro Lys Val Ser Lys Leu Val Asn Glu Leu Tyr Lys Asp 
275 280 285 


Thr Ser Ser Arg Lys Tyr Pro Val Thr lie Leu Asp Phe lie Val Pro 
290 295 300 


Gly Gly Ala Cys Asp Leu Asn Val Thr Pro Asp Lys Arg Lys Val Phe 
305 310 315 320 


Phe Ser Asp Glu Thr Ser Val lie Gly Ser Leu Arg Glu Gly Leu Asn 
325 330 335 


Glu lie Tyr Ser Ser Ser Asn Ala Ser Tyr lie Val Asn Arg Phe Glu 
340 345 350 


Glu Asn Ser Glu Gin Pro Asp Lys Ala Gly Val Ser Ser Phe Gin Lys 
355 350 365 


Lys Ser Asn Leu Leu Ser Glu Gly lie Val Leu Asp Val Ser Ser Lys 
370 375 380 


Thr Arg Leu Gly Glu Ala lie Glu Lys Glu Asn Pro Ser Leu Arg Glu 
385 390 395 400 


Val Glu lie Asp Asn Ser Ser Pro Met Glu Lys Phe Lys Phe Glu He 
405 410 415 


Lys Ala Cys Gly Thr Lys Lys Gly Glu Gly Ser Leu Ser Val His Asp 
420 425 430 


Val Thr His Leu Asp Lys Thr Pro Ser Lys Gly Leu Pro Gin Leu Asn 
435 440 445 


Val Thr Glu Lys Val Thr Asp Ala Ser Lys Asp Leu Ser Ser Arg Ser 
450 455 460 


Ser Phe Ala Gin Ser Thr Leu Asn Thr Phe Val Thr Met Gly Lys Arg 
465 470 475 480 
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Lys His Glu Asn He Ser Thr He Leu Ser Glu Thr Pro Val Leu Arg 
485 490 495 


Asn Gin Thr Ser Ser Tyr Arg Val Glu Lys Ser Lys Phe Glu Val Arg 
500 505 510 


Ala Leu Ala Ser Arg Cys Leu Val Glu Gly Asp Gin Leu Asp Asp Met 
515 520 525 


Val He Ser Lys Glu Asp Met Thr Pro Ser Glu Arg Asp Ser Glu Leu 
530 535 540 


Gly Asn Arg He Ser Pro Gly Thr Gin Ala Asp Asn Val Glu Arg His 
545 550 555 560 


Glu Arg Val Leu Gly Gin Phe Asn Leu Gly Phe He He Ala Lys Leu 
565 570 575 


Glu Arg Asp Leu Phe He Val Asp Gin His Ala Ala Asp Glu Lys Phe 
580 585 590 


Asn Phe Glu His Leu Ala Arg Ser Thr Val Leu Asn Gin Gin Pro Leu 
595 600 605 


Leu Gin pro Leu Asn Leu Glu Leu Ser Pro Glu Glu Glu Val Thr Val 
610 615 620 


Leu Met His Met Asp He He Arg Glu Asn Gly Phe Leu Leu Glu Glu 
625 630 635 640 


I Pro Ser Ala Pro Pro Gly Lys His Phe Arg Leu Arg Ala He Pro 
S45 650 655 


Tyr Ser Lys Asn He Thr Phe Gly Val Glu Asp Leu Lys Asp Leu He 

660 665 670 


Ser Thr Leu Gly Asp Asn His Gly Glu Cys Ser Val Ala Ser Ser Tyr 
675 680 685 


Lys Thr Ser Lys Thr Asp Ser He Cys Pro Ser Arg Val Arg Ala Met 
690 695 700 


Leu Ala Ser Arg Ala Cys Arg Ser Ser Val Met He Gly Asp Pro Leu 
705 710 715 720 


Arg Lys Asn Glu Met 61n Lys He Val Glu His Leu Ala Asp Leu Glu 
725 730 735 
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Ser Pro Trp Asn Cys Pro His Gly Arg Pro Thr Met Arg His Leu Val 
740 745 750 

Asp Leu Thr Thr Leu Leu Thr Leu Pro Asp Asp Asp Asn Val Asn Asp 
755 760 , 765 

Asp Asp Asp Asp Asp Ala Thr lie Ser Leu Ala 
770 775 

<210> 32 
<211> 3456 
<212> DMA 

<213> Arabidopsis thaliana 
<400> 32 

atgaagacga tcaagccctt gccggaagga gttcgtcact ccatgcgttc tggaattatc 60 

atgttcgaca tggcgagggt cgtggaagaa ctcgtcttca acagtctcga tgctggggcg 120 

accaaggtgt ctatcttcgt gggtgttgtt tcatgctctg tgaaagttgt ggatgatgga 180 

tcaggcgttt caagagatga tttggttttg ttgggagaaa gatatgctac ttcaaagttt 240 

cacgacttca ccaacgtgga gacagctagt gaaacttttg gatttcgtgg agaggcctta 300 

gcttcaatat cagatatctc gttactggag gttaggacaa aagctattgg gaggcctaat 360 

ggttatcgaa aggttatgaa gggatccaag tgtctacatc ttggaattga tgatgataga 42 0 

aaagactctg gcacgacggt aactgtccga gatctatttt acagtcagcc agtgagacga 480 

aaatatatgc aatccagccc caagaaagtt ttggaatcta tcaaaaagtg tgtgttccgg 540 

attgcccttg tgcactccaa tgtttccttc agtgttcttg atatcgaaag tgatgaagag 600 

cttttccaaa ccaatccttc ttcttcagca ttctcactac tgatgagaga tgcagggacc 660 

gaagctgtaa attcgctttg taaagtaaac gttacagatg gcatgctgaa tgtctctggt 720 

tttgagtgtg cggatgactg gaagcctacg gatgggcaac aaacaggaag acgcaataga 780 

cttcaatcca accctggtta cattctgtgc atagcatgtc cacgccgtct ttatgaattc 840 

tcgtttgaac catcaaagac gcacgttgag ttcaagaagt ggggacctgt acttgccttt 900 

atagaaagaa tcactctagc caactggaag aaagatagaa ttcttgaact ttttgatggg 960 

ggagctgata tactggcaaa aggtgataga caagacctga ttgatgacaa aattagactt 1020 

caaaacggca gccttttctc aattcttcat tttctggatg cagattggcc agaagctatg 1080 

gaacctgcaa aaaagaagct gaagagaagt aatgatcatg caccttgtag ttctctcttg 1140 

tttccgtctg ctgactttaa acaagatggt gattattttt ctccacgaaa ggatgtatgg 1200 

tctccagaat gtgaagtcga actgaaaatt cagaatccca aagagcaagg tactgtagct 1260 

ggatttgaaa gccggactga ttctcttcta cagtcacgtg acatagaaat gcaaacgaat 1320 

Page 63 

BNSDOCID: <WO 03012130A1J_> 


wo 03/012130 


PCT/USOl/23888 


gaagacttcc cacaagttac tgacctcctt gaaacaagct tggttgctga ctctaagtgc 1380 

cgtaaacagt ttctaacaag atgtcagatt accacacctg tcaatatcaa ccatgatttt 1440 

atgaaagatt cagacgtgtt aaattttcag tttcaaggat tgaaagatga gttggatgtc 1500 

agcaattgca ttggaaagca tctcttgcgt ggttgctctt caagagtaag cctaaccttt 1560 

catgagccta aactatctca tgttgaaggg tatgaatccg tcgtgcctat gatacctaat 162 0 

gaaaaacaaa gtagtccgcg ggtcctagag accagagaag gtggttcgta ctgtgatgtt 168 0 

tattctgata agactcctga ttgttcccta gggagttcat ggcaggatac tgattggttt 1740 

actccacagt gttcctcaga taggggatgt gttggaattg gagaagattt taacattacc 1800 

cccatagata ctgcggaatt tgattcttat gatgaaaaag ttggtagtaa aaagtatctt 1860 

tcttctgtca atgtggggag ctctgttact ggtagtttct gtttaagttc tgagtggtct 1920 

ccaatgtact ccacaccttc tgcgaccaag tgggagtctg agtaccagaa aggttgtcga 1980 

attcttgaac agagtttgag actgggaagg atgcctgacc ctgaattttg tttcagtgca 2040 

gctaacaaca tcaaatttga ccacgaggtc atacctgaaa tggattgctg tgaaaccggt 2100 

acagactctt tcacagctat tcagaactgc actcagttag ctgataaaat ttgcaagtct 2160 

tcgtgggggc atgcagatga tgtgcgtatt gaccaatata gtatcaggaa ggaaaagttc 2220 

agttatatgg atggcacaca gaacaatgct ggtaaacaaa ggtcaaaaag aagtcgatct 2280 

gctcctccat tttatcgaga gaagaagaga tttatcagct taagttgtaa atcagacaca 2340 

aaaccaaaga actctgatcc atcagaacct gatgatctgg agtgtttgac acaaccttgt 2400 

aatgcatctc aaatgcatct taagtgcagc atccttgatg atgtgtcgta tgaccacata 2460 

caagaaacag aaaaaagatt gagttctgcc tcagacttga aagcatctgc tggttgcagg 2520 

actgtgcact cagagaccca agatgaggat gtgcacgaag acttcagctc agaggaattt 2580 

ctggatccaa ttaaatccac aacaaaatgg cgccataact gtgcggtctc tcaggttccc 2640 

aaggaatcac acgagcttca tggtcaagat ggtgtatttg atatatcttc gggacttctg 2700 

cacttacgat ccgatgaatc cttggttcct gaatctatca acagacactc ccttgaagat 2760 

gccaaggttc tacaacaggt tgataaaaaa tatatcccaa tcgttgchtg tggaacagtt 2820 

gccatcgttg atcagcatgc tgccgatgaa agaattcgtt tggaagagct gcgtacaaag 2880 

tttattaatg atgcattatt aatttttgtg ttgacattaa aggtactgcc ggagatgggt 2940 

tatcagttac tccagagtta ttcagagcag ataagagact ggggttggat ctgcaacatt 3000 

actgtagaag ggtcaacgtc ctttaagaaa aacatgagca tcatccagcg gaaaccaaca 3060 

ccaatcacac ttaatgcggt tccatgcatt ctgggtgtaa atctatcaga tgttgatcta 3120 

ttagagtttc ttcagcagct tgctgatact gacggatcat caactattcc tccatctgtt 3180 

cttcgagtcc taaattccaa agcctgtaga ggtgcaatta tgtttggaga tagtctgtta 3240 
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ccgtcagaat gctctttaat cattgatgga ctgaagcaga cctcactttg tttccagtgt 33 00 

gctcatgggc gacctacaac agttcctctt gtcgatttga aggcattgca caaacagata 3360 

gcaaagctca gtggaagaca agtgtggcat ggcttacaac gcagagaaat tacacttgat 3420 

cgtgcaaaat cacgcttaga caacgctaaa agttaa 3456 

<210> 33 
<211> 1151 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 33 

Met Lys Thr lie Lys Pro Leu Pro Glu Gly Val Arg His Ser Met Arg 
15 10 15 

Ser Gly lie He Met Phe Asp Met Ala Arg Val Val Glu Glu Leu Val 
20 25 30 

Phe Asn Ser Leu Asp Ala Gly Ala Thr Lys Val Ser He Phe Val Gly 
35 40 45 

Val Val Ser Cys Ser Val Lys Val Val Asp -Asp Gly Ser Gly Val Ser 
50 55 60 

Arg Asp Asp Leu Val Leu Leu Gly Glu Arg Tyr Ala Thr Ser Lys Phe 


His Asp Phe Thr Asn Val Glu Thr Ala Ser Glu Thr Phe Gly Phe Arg 
85 90 95 


Gly Glu Ala Leu Ala Ser He Ser Asp lie Ser Leu Leu Glu Val Arg 
100 105 110 


Thr Lys Ala He Gly Arg Pro Asn Gly Tyr Arg Lys Val Met Lys Gly 
115 120 125 


Ser Lys Cys Leu His Leu Gly He Asp Asp Asp Arg Lys Asp Ser Gly 
130 135 140 


Thr Thr Val Thr Val Arg Asp Leu Phe Tyr Ser Gin Pro Val Arg Arg 
145 150 155 160 


Lys Tyr Met Gin Ser Ser Pro Lys Lys Val Leu Glu Ser He Lys Lys 
165 170 175 


Cys Val Phe Arg He Ala Leu Val His Ser Asn Val Ser Phe Ser Val 
180 185 190 
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Leu Asp lie Glu Ser Asp Glu Glu Leu Phe Gin Thr Asn Pro Ser Ser 
195 200 205 


Ser Ala Phe Ser Leu Leu Met Arg Asp Ala Gly Thr Glu Ala Val Asn 
210 215 220 


Ser Leu Cys Lys Val Asn Val Thr Asp Gly Met Leu Asn Val Ser Gly 
225 230 235 240 


Phe Glu Cys Ala Asp Asp Trp Lys Pro Thr Asp Gly Gin Gin Thr Gly 
245 250 255 


Arg Arg Asn Arg Leu Gin Ser Asn Pro Gly Tyr lie Leu Cys He Ala 
260 2S5 270 


Cys Pro Arg Arg Leu Tyr Glu Phe Ser Phe Glu Pro Ser Lys Thr His 
275 280 285 


Val Glu Phe Lys Lys Trp Gly Pro Val Leu Ala Phe lie Glu Arg He 
290 295 300 


Thr Leu Ala Asn Trp Lys Lys Asp Arg He Leu Glu Leu Phe Asp Gly 
305 310 315 320 


Gly Ala Asp He Leu Ala Lys Gly Asp Arg Gin Asp Leu He Asp Asp 

325 330 335 


Lys He Arg Leu Gin Asn Gly Ser Leu Phe Ser He Leu His Phe Leu 
340 345 350 


Asp Ala Asp Trp Pro Glu Ala Met Glu Pro Ala Lys Lys Lys Leu Lys 
355 350 365 


Arg Ser Asn Asp His Ala Pro Cys Ser Ser Leu Leu Phe Pro Ser Ala 
370 375 380 


Asp Phe Lys Gin Asp Gly Asp Tyr Phe Ser Pro Arg Lys Asp Val Trp 
3B5 390 395 400 


Ser Pro Glu Cys Glu Val Glu Leu Lys He Gin Asn Pro Lys Glu Gin 
405 410 415 


Gly Thr Val Ala Gly Phe Glu Ser Arg Thr Asp Ser Leu Leu Gin Ser 
420 425 430 


Arg Asp He Glu Met Gin Thr Asn Glu Asp Phe Pro Gin Val Thr Asp 
435 440 445 


Page 66 


wo 03/012130 


PCT/USOl/23888 


Leu Leu Glu Thr Ser Leu Val Ala Asp Ser Lys Cys Arg Lys Gin Phe 
450 455 460 


Leu Thr Arg Cys Gin lie Thr Thr Pro Val Asn lie Asn His Asp Phe 
465 470 475 480 


Glu Leu Asp Val Ser Asn Cys He Gly Lys His Leu Leu Arg Gly Cys 
500 505 510 


Ser Ser Arg Val Ser Leu Thr Phe His Glu Pro Lys Leu Ser His Val 
515 520 525 


Thr Asp Trp Phe Thr Pro Gin Cys Ser Ser Asp Arg Gly Cys Val Gly 
580 585 590 


Pro Met Tyr Ser Thr Pro Ser Ala Thr Lys Trp Glu Ser Glu Tyr Gin 
645 650 655 


Lys Gly Cys Arg He Leu Glu Gin Ser Leu Arg Leu Gly Arg Met Pro 
660 665 670 


Asp Pro Glu Phe Cys Phe Ser Ala Ala Asn Asn He Lys Phe Asp His 
675 680 685 
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Thr AJa lie Gin Asn Cys Thr Gin Leu Ala Asp Lys He Cys Lys Ser 
705 710 715 720 


Ser Tnrp Gly His Ala Asp Asp Val Arg He Asp Gin Tyr Ser He Arg 
725 730 735 


Lys GJu Lys Phe Ser Tyr Met Asp Gly Thr Gin Asn Asn Ala Gly Lys 
740 745 750 


Gin Arg Ser Lys Arg Ser Arg Ser Ala Pro Pro Phe Tyr Arg Glu Lys 
755 760 755 


Lys Axg Phe He Ser Leu Ser Cys Lys- Ser Asp Thr Lys Pro Lys Asn 
770 775 780 


Ser Asp Pro Ser Glu Pro Asp Asp Leu Glu Cys Leu Thr Gin Pro Cys 
785 790 795 800 


Asn Ala Ser Gin Met His Leu Lys Cys Ser He Leu Asp Asp Val Ser 
805 810 315 


Tyr Asp His He Gin Glu Thr Glu Lys Arg Leu Ser Ser Ala Ser Asp 
820 825 830 


Leu Lys Ala Ser Ala Gly Cys Arg Thr Val His Ser Glu Thr Gin Asp 
835 840 845 


Glu Asp Val His Glu Asp Phe Ser Ser Glu Glu Phe Leu Asp Pro He 
850 355 860 


Lys Ser Thr Thr Lys Trp Arg His Asn Cys Ala Val Ser Gin Val Pro 
865 870 875 880 


Lys Glu Ser His Glu Leu His Gly Gin Asp Gly Val Phe Asp He Ser 
885 890 895 


Ser Gly Leu Leu His Leu Arg Ser Asp Glu Ser Leu Val Pro Glu Ser 
900 905 910 


He Asn Arg His Ser Leu Glu Asp Ala Lys Val Leu Gin Gin Val Asp 
915 920 925 


Lys ■ Lys Tyr He Pro He Val Ala Cys Gly Thr Val Ala He Val Asp 

930 935 940 


Gin His Ala Ala Asp Glu Arg He Arg Leu Glu Glu Leu Arg Thr Lys 
945 950 955 960 
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Phe He Asn Asp Ala Leu Leu He Phe Val Leu Thr Leu Lys Val Leu 
965 970 975 


Pro Glu Met Gly Tyr Gin Leu Leu Gin Ser Tyr Ser Glu Gin He Arg 
980 985 990 

Asp Trp Gly Trp He Cys Asn He Thr Val Glu Gly Ser Thr Ser Phe 
995 1000 1005 

Lys Lys Asn Met Ser He He Qln Arg Lys Pro Thr Pro He Thr 
1010 1015 1020 

Leu Asn Ala Val Pro Cys He Leu Gly Val Asn Leu Ser Asp Val 
1025 1030 1035 

Asp Leu Leu Glu Phe Leu Gin Gin Leu Ala Asp Thr Asp Gly Ser 
1040 1045 1050 

Ser Thr He Pro Pro Ser Val Leu Arg Val Leu Asn Ser Lys Ala 
1055 1060 1065 

Cys Arg Gly Ala He Met Phe Gly Asp Ser Leu Leu Pro Ser Glu 
1070 1075 1080 

Cys Ser Leu He He Asp Gly Leu Lys Gin Thr Ser Leu Cys Phe 
1085 1090 1095 


Gin Cys Ala His Gly Arg Pro Thr Thr Val Pro Leu Val Asp Leu 
1100 1105 1110 


Lys Ala Leu His Lys Gin He Ala Lys Leu Ser Gly Arg Gin Val 
1115 1120 1125 


Trp His Gly Leu Gin Arg Arg Glu He Thr Leu Asp Arg Ala Lys 
1130 1135 1140 


Ser Arg Leu Asp Asn Ala Lys Ser 
1145 1150 


<210> 34 

<211> 3330 

<212> DNA 

<213> Arabidopsis thaliana 


<400> 34 

atgcagcgcc agagatcgat tttgtctttc ttccaaaaac ccacggcggc gactacgaag 60 

ggtttggttt ccggcgatgc tgctagcggc gggggcggca gcggaggacc acgatttaat 120 

gtgaaggaag gggatgctaa aggcgacgct tctgtacgtt ttgctgtttc gaaatctgtc 180 
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gatgaggtta gaggaacgga tactccaccg gagaaggttc cgcgtcgtgt cctgccgtct 240 

ggatttaagc cggctgaatc cgccggtgat gcttcgtccc tgttctccaa tattatgcat 300 

aagtttgtaa aagtcgatga tcgagattgt tctggagaga ggagccgaga agatgttgtt 360 

ccgctgaatg attcatctct a'tgtatgaag gctaatgatg ttattcctca atttcgttcc 420 

aataatggta aaactcaaga aagaaaccat gcttttagtt tcagtgggag agctgaactt 480 

agatcagtag aagatatagg agtagatggc gatgttcctg gtccagaaac accagggatg 540 

cgtccacgtg cttctcgctt gaagcgagtt ctggaggatg aaatgacttt taaggaggat 600 

aaggttcctg tattggactc taacaaaagg ctgaaaatgc tccaggatcc ggtttgtgga 660 

gagaagaaag aagtaaacga aggaaccaaa tttgaatggc ttgagtcttc tcgaatcagg 720 

gatgccaata gaagacgtcc tgatgatccc ctttacgata gaaagacctt acacatacca 780 

cctgatgttt tcaagaaaat gtctgcatca caaaagcaat attggagtgt taagagtgaa 840 

tatatggaca ttgtgctttt ctttaaagtg gggaaatttt atgagctgta tgagctagat 900 

gcggaattag gtcacaagga gcttgactgg aagafcgacca tgagtggtgt gggaaaatgc 960 

agacaggttg gtatctctga aagtgggata gatgaggcag tgcaaaagct attagctcgt 1020 

ggatataaag ttggacgaat cgagcagcta gaaaaatctg accaagcaaa agccagaggt 1080 

gctaatacta taattccaag gaagctagtt caggtattaa ctccatcaac agcaagcgag 1140 

ggaaacatcg ggcctgatgc cgtccatctt cttgctataa aagagatcaa aatggagcta 1200 

caaaagtgtt caactgtgta tggatttgct tttgttgact gtgctgcctt gaggttttgg 1260 

gttgggtcca tcagcgatga tgcatcatgt gctgctcttg gagcgttatt gatgcaggtt 1320 

tctccaaagg aagtgttata tgacagtaaa gggctatcaa gagaagcaca aaaggctcta 1380 

aggaaatata cgttgacagg gtctacggcg gtacagttgg ctccagtacc acaagtaatg 1440 

ggggatacag atgctgctgg agttagaaat ataatagaat ctaacggata ctttaaaggt 1500 

tcttctgaat catggaactg tgctgttgat ggtctaaatg aatgtgatgt tgcccttagt 1560 

gctcttggag agctaattaa tcatctgtct aggctaaagc tagaagatgt acttaagcat 1620 

ggggatattt ttccatacca agtttacagg ggttgtctca gaattgatgg ccagacgatg 1680 

gtaaatcttg agatatttaa caatagctgt gatggtggtc cttcagggac cttgtacaaa 1740 

tatcttgata actgtgttag tccaactggt aagcgactct taaggaattg gatctgccat 1800 

ccactcaaag atgtagaaag catcaataaa cggcttgatg tagttgaaga attcacggca 1860 

aactcagaaa gtatgcaaat cactggccag tatctccaca aacttccaga cttagaaaga 1920 

ctgctcggac gcatcaagtc tagcgttcga tcatcagcct ctgtgttgcc tgctcttctg 1980 

gggaaaaaag tgctgaaaca acgagttaaa gcatttgggc aaattgtgaa agggttcaga 2040 

agtggaattg atctgttgtt ggctctacag aaggaatcaa atatgatgag tttgctttat 2100 
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aaactctgta 

aacttcctat 

attagtagga aaaagcgggc tagagttatt tctttctcaa 

2160 

ttcgaagcag 

ccatagatag 

cgactttcca aattatcaga accaagatgt gacagatgaa 

2220 

aacgctgaaa 

ctctcacaat 

acttatcgaa ctttttatcg aaagagcaac tcaatggtct 

2280 

gaggtcattc 

acaccataag 

ctgcctagat gtcctgagat cttttgcaat cgcagcaagt 

2340 

ctctctgctg gaagcatggc caggcctgfct atttttcccg aatcagaaga tacagatcag 

2400 

aatcagaaaa 

caaaagggcc 

aatacttaaa atccaaggac tatggcatcc atttgcagtt 

2460 

gcagccgatg 

gtcaattgcG 

tgttccgaat gatatactcc ttggcgaggc tagaagaagc 

2520 

agtggcagca 

ttcatcctcg 

gtcattgtta ctgacgggac caaacatggg cggaaaatca 

2580 

actcttcttc 

gtgcaacatg 

tctggccgtt atctttgccc aacttggctg ctacgtgccg 

2640 

tgtgagtctt gcgaaatctc 

cctcgtggat actatcttca caaggcttgg cgcatctgat 

2700 

agaatcatga 

caggagagag 

tacctttttg gtagaatgca ctgagacagc gtcagttctt 

2760 

cagaatgcaa 

ctcaggattc 

actagtaatc cttgacgaac tgggcagagg aactagtact 

2820 

ttcgatggat 

acgccattgc 


2880 

cggatgctct 

ttgcaacaca 

ttaccaccct ctcaccaagg aattcgcgtc tcacccacgt 

2940 

gtcacctcga 

aacacatggc 

ttgcgcattc aaatcaagat ctgattatca accacgtggt 

3000 

tgtgatcaag acctagtgtt 

cttgtaccgt ttaaccgagg gagcttgtcc tgagagctac 

3060 

ggacttcaag 

tggcactcat 

ggctggaata ccaaaccaag tggttgaaac agcatcaggt 

3120 

gctgctcaag 

ccatgaagag 

atcaattggg gaaaacttca agtcaagtga gctaagatct 


gagttctcaa 

gtctgcatga 

agactggctc aagtcattgg tgggtatttc tcgagtcgcc 

3240 

cacaacaatg 

cccccattgg 

cgaagatgac tacgacactt tgttttgctt atggcatgag 

3300 

atcaaatcct 

cttactgtgt 

tcccaaataa 

3330 


<210> 35 
<211> 1109 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 35 

Met Gin Arg Gin Arg Ser lie Leu Ser Phe Phe Gin Lys Pro Thr Ala 
15 10 15 

Ala Thr Thr Lys Gly Leu Val Ser Gly Asp Ala Ala Ser Gly Gly Gly 
20 25 30 

Gly Ser Gly Gly Pro Arg Phe Asn Val Lys Glu Gly Asp Ala Lys Gly 
35 40 45 

Asp Ala Ser Val Arg Phe Ala Val Ser Lys Ser Val Asp Glu Val Arg 
50 55 60 
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Gly Thr Asp Thr Pro Pro Glu Lys Val Pro Arg Arg Val Leu Pro Ser 


Gly Phe Lys Pro Ala Glu Ser Ala Gly Asp Ala Ser Ser Leu Phe Ser 


Glu Arg Ser Arg Glu Asp Val Val Pro Leu Asn Asp Ser Ser Leu Cys 
115 120 125 


Arg Ser Val Glu Asp lie Gly Val Asp Gly Asp Val Pro Gly Pro Glu 
165 170 175 


Thr Pro Gly Met Arg Pro Arg Ala Ser Arg Leu Lys Arg Val Leu Glu 
180 135 190 


Lys Arg Leu Lys Met Leu Gin Asp Pro Val Cys Gly Glu Lys Lys Glu 
210 215 220 


Asp Ala Asn Arg Arg Arg Pro Asp Asp Pro Leu Tyr Asp Arg Lys Thr 
245 250 255 


Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu Leu Asp Ala Glu Leu Gly 
290 295 300 
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Arg Gin Val Gly lie Ser Glu Ser Gly lie Asp Glu Ala Val Gin Lys 
325 330 335 


Leu Leu Ala Arg Gly Tyr Lys Val Gly Arg lie Glu Gin Leu Glu Thr 
340 345 350 


Ser Asp Gin Ala Lys Ala Arg Gly Ala Asn Thr lie He Pro Arg Lys 

355 360 365 


Leu Val Gin Val Leu Thr Pro Ser Thr Ala Ser Glu Gly Asn He Gly 
370 375 380 


Pro Asp Ala Val His Leu Leu Ala lie Lys Glu He Lys Met Glu Leu 
385 390 395 400 


Gin Lys Cys Ser Thr Val Tyr Gly Phe Ala Phe Val Asp Cys Ala Ala 
405 410 415 


Leu Arg Phe Trp Val Gly Ser He Ser Asp Asp Ala Ser Cys Ala Ala 
420 425 430 


Leu Thr Gly Ser Thr Ala Val Gin Leu Ala Pro Val Pro Gin Val Met 
465 470 475 480 


Gly Asp Thr Asp Ala Ala Gly Val Arg Asn He He Glu Ser Asn Gly 
485 490 495 


Tyr Phe Lys Gly Ser Ser Glu Ser Trp Asn Cys Ala Val Asp Gly Leu 

500 505 510 


Asn Glu Cys Asp Val Ala Leu Ser Ala Leu Gly Glu Leu He Asn His 
515 520 525 


Leu Ser Arg Leu Lys Leu Glu Asp Val Leu Lys His Gly Asp He Phe 
530 535 540 


Pro Tyr Gin Val Tyr Arg Gly Cys Leu Arg He Asp Gly Gin Thr IVIet 
545 550 555 560 


Val Asn Leu Glu He Phe Asn Asn Ser Cys Asp Gly Gly Pro Ser Gly 
5S5 570 575 
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Leu Leu Arg Asn Trp lie Cys His Pro Leu Lys Asp Val Glu Ser lie 
595 600 605 


Met Gin. He Thr Gly Gin Tyr Leu His Lys Leu Pro Asp Leu Glu Arg 
625 630 635 640 


Gly Gin He Val Lys Gly Phe Arg Ser Gly He Asp Leu Leu Leu Ala 
675 680 685 


Phe Glu Ala Ala He Asp Ser Asp Phe Pro Asn Tyr Gin Asn Gin Asp 
725 730 735 


He Glu Arg Ala Thr Gin Trp Ser' Glu Val He His Thr He Ser Cys 

755 760 765 


Ser Met Ala Arg Pro Val He Phe Pro Glu Ser Glu Ala Thr Asp Gin 
785 790 795 800 


Asn Gin Lys Thr Lys Gly Pro He Leu Lys He Gin Gly Leu Trp His 
805 810 815 


Pro Phe Ala Val Ala Ala Asp Gly Gin Leu Pro Val Pro Asn Asp He 
820 825 830 
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Leu Leu Leu Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Leu Leu Arg 
850 855 860 


Gly Ala Ser Asp Arg He Met Thr Gly Glu Ser Thr Phe Leu Val Glu 
900 905 910 


Cys Thr Glu Thr Ala Ser Val Leu Gin Asn Ala Thr Gin Asp Ser Leu 
915 920 925 


Ala He Ala Tyr Ser Val Phe Arg His Leu Val Glu Lys Val Gin Cys 
945 950 955 960 


Arg Met Leu Phe Ala Thr His Tyr His Pro Leu Thr Lys Glu Phe Ala 
965 970 975 


Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp Gin Asp Leu Val Phe Leu 
995 1000 1005 


Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu Ser Tyr Gly Leu Gin 
1010 ■• 1015 1020 


Val Ala Leu Met Ala Gly He Pro Asn Gin Val Val Glu Thr Ala 

1025 1030 1035 


Ser Gly Ala Ala Gin Ala Met Lys Arg Ser He Gly Glu Asn Phe 
1040 1045 1050 


Lys Ser Ser Glu Leu Arg Ser Glu Phe Ser Ser Leu His Glu Asp 
1055 1060 1065 


Trp Leu Lys Ser Leu Val Gly He Ser Arg Val Ala His Asn Asn 
1070 1075 1080 
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Ala Pro He Gly Glu Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp 
1085 1090 1095 

His Glu He Lys Ser Ser Tyr Cys Val Pro Lys 
1100 1105 

<210=. 36 

<211> 1170 

<212> DNA 

<213> Homo sapiens 

<400> 36 

atggcgcaac caaagcaaga gagggtggcg cgtgccagac accaacggtc ggaaaccgcc 60 

agacaccaac ggtcggaaac cgccaagaca ccaacgctcg gaaaccgcca gacaccaacg 120 

ctcggaaacc gccagacacc aaggctcgga atccacgcca ggccacgacg gagggcgact 180 

acctcccttc tgaccctgct gctggcgttc ggaaaaaacg cagtccggtg tgctctgatt 240 

ggtccaggct ctttgacgtc acggactcga cctttgacag agccactagg cgaaaaggag 300 

agacgggaag tatttttfccc gccccgcccg gaaagggtgg agcacaacgt cgaaagcagc 360 

cgttgggagc ccaggaggcg gggcgcctgt gggagccgtg gagggaactt tcccagtccc 420 

cgaggcggat ccggtgttgc atccttggag cgagctgaga actcgagtac agaacctgct 4B0 

aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gccggtggta 540 

ccgagtctaa ggccgaatgc ggtgaaggag ttagtagaaa acagtctgga tgctggtgcc 600 

actaatgttg atctaaagct taaggactat ggagtggatc tcattgaagt ttcaggcaat 660 

ggatgtgggg tagaagaaga aaacttcgaa ggctttactc tgaaacatca cacatgtaag 720 

attcaagagt ttgccgacct aactcaggtg gaaacttttg gctttcgggg ggaagctctg 780 

agctcacttt gtgcactgag tgatgtcacc atttctacct gccgtgtatc agcgaaggtt 840 

gggactcgac tggtgtttga tcactatggg aaaatcatcc agaaaacccc ctacccccgc 900 

cccagaggga tgacagtcag cgtgaagcag ttattttcta cgctacctgt gcaccataaa 960 

gaatttcaaa ggaatattaa gaagaaacgt gcctgcttcc ccttcgcctt ctgccgtgat 1020 

tgtcagtttc ctgaggcctc cccagccatg cttcctgtac agcctgtaga actgactcct 1080 

agaagtaccc caccccaccc ctgctccttg gaggacaacg tgatcactgt attcagctct 1140 

gtcaagaatg gtccaggttc ttctagatga 1170 

<210> 37 

<211> 389 

<212> PRT 

<213> Homo sapiens 

<400> 37 
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Met Ala Gin Pro Lys Gin Glu Arg Val Ala Arg Ala Arg His Gin Arg 
15 10 15 

Ser Glu Thr Ala Arg His Gin Arg Ser Glu Thr Ala Lys Thr Pro Thr 
20 25 30 

Leu Gly Asn Arg Gin Thr Pro Thr Leu Gly Asn Arg Gin Thr Pro Arg 
35 40 45 

Leu Gly lie His Ala Arg Pro Arg Arg Arg Ala Thr Thr Ser Leu Leu 
50 55 50 

Thr Leu Leu Leu Ala Phe Gly Lys Asn Ala Val Arg Cys Ala Leu lie 


Gly Pro Gly Ser Leu Thr Ser Arg Thr Arg Pro Leu Thr Glu Pro Leu 
85 90 95 


Gly Glu Lys Glu Arg Arg Glu Val Phe Phe Pro Pro Arg Pro Glu Arg 
100 105 110 

Val Glu His Asn Val Glu Ser Ser Arg Tip Glu Pro Arg Arg Arg Gly 
115 120 125 

Ala Cys Gly Ser Arg Gly Gly Asn Phe Pro Ser Pro Arg Gly Gly Ser 
130 135 140 


Gly Val Ala Ser Leu Glu Arg Ala Glu Asn Ser Ser Thr Glu Pro Ala 
145 150 155 160 


Lys Ala He Lys Pro He Asp Arg Lys Ser Val His Gin He Cys Ser 
165 170 175 

Gly Pro Val Val Pro Ser Leu Arg Pro Asn Ala Val Lys Glu Leu Val 
180 185 190 

Glu Asn Ser Leu Asp Ala Gly Ala Thr Asn Val Asp Leu Lys Leu Lys 
195 200 205 


Asp Tyr Gly Val Asp Leu He Glu Val Ser Gly Asn Gly Cys Gly Val 
210 215 220 


Glu Glu Glu Asn Phe Glu Gly Phe Thr Leu Lys His His Thr Cys Lys 
225 230 235 240 


He Gin Glu Phe Ala Asp Leu Thr Gin Val Glu Thr Phe Gly Phe Arg 
245 250 255 
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Gly G2u Ala Leu Ser Ser Leu Cys Ala Leu Ser Asp Val Thr He Ser 
260 265 270 


Thr Cys Arg Val Ser Ala Lys Val Gly Thr Arg Leu Val Phe Asp His 
275 280 285 


Tyr Gly Lys He He Gin Lys Thr Pro Tyr Pro Arg Pro Arg Gly Met 
290 295 300 


Thr Val Ser Val Lys Gin Leu Phe Ser Thr Leu Pro Val His His Lys 
305 310 315 320 


Glu Phe Gin Arg Asn He Lys Lys Lys Arg Ala Cys Phe Pro Phe Ala 
325 330 335 


Phe Cys Arg Asp Cys Gin Phe Pro Glu Ala Ser Pro Ala Met Leu Pro 
340 345 350 


Val Gin Pro Val Glu Leu Thr Pro Arg Ser Thr Pro Pro His Pro Cys 
355 360 365 


Ser Leu Glu Asp Asn Val He Thr Val Phe Ser Ser Val Lys Asn Gly 
370 375 380 


Pro Gly Ser Ser' Arg 
385 


<210> 38 

<2H> 795 

<212> DNA 

<213> Homo sapiens 

<400> 38 

atgtgtcctt ggcggcctag actaggccgt cgctgtatgg tgagccccag ggaggcggat 60 

ctgggccccc agaaggacac ccgcctggat ttgccccgta gcccggcccg ggcccctcgg 120 

gagcagaaca gccttggtga ggtggacagg aggggacctc gcgagcagac gcgcgcgcca 180 

gcgacagcag ccccgccccg gcctctcggg agccgggggg cagaggctgc ggagccccag 240 

gagggtctat cagccacagt ctctgcatgt ttccaagagc aacaggaaat gaacacattg 300 

caggggccag tgtcattcaa agatgtggct gtggatttca cccaggagga gtggcggcaa 360 

ctggaccctg atgagaagat agcatacggg gatgtgatgt tggagaacta cagccatcta 420 

gtttctgtgg ggtatgatta tcaccaagcc aaacatcatc atggagtgga ggtgaaggaa 480 

gtggagcagg gagaggagcc gtggataatg gaaggtgaat ttccatgtca acatagtcca 540 

gaacctgcta aggccatcaa acctattgat cggaagtcag tccatcagat ttgctctggg 600 

ccagtggtac tgagtctaag cactgcagtg aaggagttag tagaaaacag tctggatgct 660 
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ggtgccacta atattgatct aaagcttaag gactatggag tggatctcat tgaagtttca 720 
gacaatggat gtggggtaga agaagaaaac tttgaaggct taatctcttt cagctctgaa 780 
acatcacaca tgtaa 795 

<210> 39 

<211> 264 

<212> PRT 

<213> Homo sapiens 

<400> 39 

Met Cys Pro Trp Arg Pro Arg Leu Gly Arg Arg Cys Met Val Ser Pro 
15 10 15 

Arg Glu Ala Asp Leu Gly Pro Gin Lys Asp Thr Arg Leu Asp Leu Pro 


Arg Ser Pro Ala Arg Ala Pro Arg Glu Gin Asn Ser Leu Gly Glu Val 
35 40 45 


Asp Arg Arg Gly Pro Arg Glu Gin Thr Arg Ala Pro Ala Thr Ala Ala 


Pro Pro Arg Pro Leu Gly Ser Arg Gly Ala Glu Ala Ala Glu Pro Gin 
65 70 75 80 


Glu Gly Leu Ser Ala Thr Val Ser Ala Cys Phe Gin Glu Gin Gin Glu 
85 90 95 


Met Asn Thr Leu Gin Gly Pro Val Ser Phe Lys Asp Val Ala Val Asp 
100 105 110 


Phe Thr Gin Glu Glu Trp Arg Gin Leu Asp Pro Asp Glu Lys lie Ala 
115 120 125 


Tyr Gly Asp Val Met Leu Glu Asn Tyr Ser His Leu Val Ser Val Gly 
130 135 140 


Tyr Asp Tyr His Gin Ala Lys His His His Gly Val Glu Val Lys Glu 
145 150 155 160 


Val Glu Gin Gly Glu Glu Pro Trp lie Met Glu Gly Glu Phe Pro Cys 
165 170 175 


Gin His Ser Pro Glu Pro Ala Lys Ala lie Lys Pro lie Asp Arg Lys 
180 185 190 


Ser Val His Gin lie Cys Ser Gly Pro Val Val Leu Ser Leu Ser Thr 
195 200 205 
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Ala Val Lys Glu Leu Val Glu Asn Ser Leu Asp Ala Gly Ala Thr Asn 
210 215 220 


lie Asp Leu Lys Leu Lys Asp Tyr Gly Val Asp Leu lie Glu Val Ser 

225 230 235 240 


Phe Ser Ser Glu Thr Ser His Met 
260 
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